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(54) Title: INTEGRASE COFACTOR 

(57) Abstract: In a study of HIV-1 integrase (IN) complexes derived from nuclei of human cells stably expressing the viral protein 
from a synthetic gene it was demonstrated that in the nuclear extracts IN exists as part of a large distinct complex with apparent 
Stokes radius of 61 A, which dissociates upon dilution yielding a core molecule of 41 A. The IN complexes were isolated from cells 
expressing FLAG-tagged IN. By present invention it was demonstrated that the 41 A core is tetramer of IN, whereas 61 A molecules 
are composed of IN tetramers associated with a cellular protein with an apparent molecular weight of 76 kDa. This integrase in- 
teracting protein (Inip76) was found to be identical to LEDGF/DFS70/p75 a protein implicated in regulation of gene expression 
and cellular stress-response. HIV-1 IN and Inip76 co-localized in the nuclei of human cells stably expressing IN. Furthermore, 
it has been demonstrated by present invention that recombinant Inip76 strongly promoted strand-transfer activity of HTV-1 IN in 
vitro. Our findings reveal that the minimal IN molecule in human cells is a tetramer and clearly demonstrates that Inip76 is plays a 
role in retroviral integration. Therefore the present invention provides integrase interacting proteins and more particularly cofactors 
which promote strand transfer activity of viral integrase, more particularly HIV integrase, and methods and uses relating thereto. 
The present invention relates to a cellular protein that associates with integrase ( integrase interacting protein-Inip), to molecules 
interacting with Inip and their use as an antiviral. The present invention also relates to antibodies, RNA interference, antigen therapy, 
gene silencing or antisense inhibition of said integrase interacting protein. The novel integrase interaction protein is a target forHTV 
replication prevention or inhibition. 
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FIELD OF THE INVENTION 



5 The present invention provides integrase interacting proteins and more particularly 
cofactors which promote strand transfer activity of viral integrase, more particularly HIV 
integrase, and methods and uses relating thereto. The present invention relates to a 
cellular protein that associates with integrase (integrase interacting protein - Inip), to 
molecules interacting with Inip and their use as an antiviral. The present invention also 
10 relates to antibodies, RNA interference, antigene therapy, gene silencing or antisense 
inhibition of said integrase interacting protein. The novel integrase interaction protein is a 
target for HIV replication prevention or inhibition. 

More particular, the present invention identifies the cellular protein, Inip76, found to be 
identical to LEDGF/DFS70/p75 as a protein that interacts with HIV-integrase. 
15 Coexpression of Inip76 specifically enhances, and depletion of Inip76 decreases retroviral 
genome integration into cellular chromosomal DNA. These results implicate Inip76 as a 
specific coactivator of retroviral integrase. 



20 BACKGROUND OF THE INVENTION 

Establishment of the provirus, a DNA copy of the viral genome integrated into the host 
cell chromosome is an obligatory step in retroviral replication. Stable integration into the 
human genome is one of the primary reasons for the persistence of the human 

25 immunodeficiency virus (HIV) infection, which leads to the acquired immunodeficiency 
syndrome (AIDS). Therefore, HIV integrase (IN), the enzyme orchestrating the insertion 
of the DNA replica of the viral genome into the cellular chromosomal DNA, is a potential 
target for antiretroviral therapy. The recent discovery of the new lead compounds able to 
protect cells against HIV infection by inhibiting integration raises hope that integrase 

30 inhibitors will be added to future combination cocktails (for reviews see Debyser et al., 
2002; Miller and Hazuda, 2001; Pani and Marongiu, 2000; Pommier et al., 2000). 
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Mechanistically ai^fc-ucturally, retroviral integrases (reviewe^^Asante-Appiah and 

Skalka, 1997; Brown et al., 1987; Craigie, 2001; Hindmarsh and Leis, 1999) are similar to 
the 'well-studied prokaryotic Mu and Tn5 transposases and belong to a family of DNA 
strand transferases that catalyze DNA strand cleavage/ligation via direct trans- 
5 esterification. In the course of viral integration, HIV IN performs two enzymatic 
reactions: first is the removal of the 3'- GT dinucleotides from the long terminal repeats 
(LTRs) (i.e. the 3 '-processing reaction). The second reaction is the insertion of the 
recessed 3'- viral DNA ends into the opposite strands of the target DNA wherein the 3' 
hydroxyls of the processed LTR ends attack two phosphodiester bonds in the target DNA 

10 molecule (the strand-transfer reaction). Insertion of the two viral LTRs takes place in a 
coordinated fashion across the major groove of the target DNA. As a result, the integrated 
provirus is flanked by two 5-nucleotide gaps as well as two mismatched 5 '-AC 
dinucleotides, which are probably repaired by cellular enzymes (Yoder and Bushman, 
2000). HIV IN, like other retroviral integrases, is comprised of three structural domains. 

15 The amino terminal domain contains the HHCC zinc finger motif, coordinates Zn and 
appears to be important for formation of the active IN multimers. The central, catalytic or 
"core" domain (residues 50-212) contains the active site (i.e. the DDE motif composed of 
D64, D116 and El 52) and is structurally very similar to the catalytic domains of Mu 
phage and Tn5 transposases. In the context of the full-length enzyme, the core domain is 

20 thought to make sequence-specific contacts with the DNA. The carboxy terminal 
fragment possesses unspecific DNA-binding activity and is critical for multimerization of 
the enzyme. All single domain fragments of IN are dimeric when produced separately; the 
full-length HIV IN as well as other retroviral integrases form multimers under the 
conditions of the in vitro enzymatic assays (Ellison et al, 1995; Engelman et al., 1993). 

25 Recombinant HIV and avian sarcoma virus (ASV) integrases have been shown to behave 
as mixtures of monomers, dimers and tetramers (Coleman et al., 1999;.Deprez et al, 
2000; Jenkins et al., 1996). The presence of octamers and larger complexes has been 
suggested by some studies (Lee et al., 1997; Leh et al., 2000). Virion-associated HIV IN 
was shown to be in an oligomeric form, whereby dimers and higher order multimers 

30 appeared to be stabilized by disulfides, although the complexes were not studied under 
native conditions (Petit et al., 1999). Earlier, IN derived from avian myoblastosis virus 
(AMV) particles was demonstrated to be multimeric, behaving at least as a dimer 
(Grandgenett et al., 1 978). 
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The 3'-procesi 



and strand-transfer reactions can be 



>duced in vitro using 



recombinant enzyme preparations and DNA substrates that mimic LTR ends. In vivo 
retroviral integration is preceded by the assembly of a stable and compact preintegration 
complex (PIC), that contains a 9.5 kb linear DNA copy of the viral genome associated 
5 with viral and cellular proteins (Brown et al., 1987; Bukrinsky et al., 1993; Ellison et al., 
1990; Farnet and Haseltine, 1991; Miller et al., 1997). Several cellular proteins have been 
suggested to play auxiliary roles during retroviral integration. Thus, the barrier-to- 
autointegration factor (B AF) has been reported to protect Moloney murine leukemia virus 
(MoMLV) PICs against suicidal self-integration (Lee and Craigie, 1998). Another cellular 

10 protein, HMG-I(Y) was found in HIV PICs and appeared to be essential for their 
integration activity in vitro (Farnet and Bushman, 1997; Miller et al., 1997). Conversely, 
BAF could substitute for HMG-IQO at least in vitro, partially restoring integration 
activity of salt-denatured HIV-1 PICs (Chen and Engelman, 1998). Yet, BAF remains to 
be shown to co-fractionate with retroviral PICs. Both BAF and HMG-IQO are small 

15 DNA-binding proteins able to bridge and deform DNA molecules (Reeves and 
Beckerbauer, 2001; Zheng et al., 2000). They are thought to play structural roles within 
retroviral PICs, possibly juxtaposing both LTRs. Similarly, Mu phage transposase as well 
as enzymatically-unrelated X phage integrase require the DNA-bending host proteins IHF 
and/or HU to form committed synaptic complexes (Friedman, 1992; Mizuuchi, 1992). 

20 IHF and HU appear to substitute for each other in promoting formation of the active X 
integrase-DNA complexes in vitro (Segall et al., 1994). Another potential co-factor for 
HIV integration, the integrase interactor 1 (Inil or hSNF5/BAF47), was originally 
discovered in a yeast two-hybrid screen for human proteins interacting with HTV-1 IN 
(Kalpana et al., 1994). Ml is a subunit of the 2 MDa SWI/SNF chromatin-remodeling 

25 complex (Wang et al., 1996). It has been proposed that Inil plays a role during retroviral 
replication by directing the PICs to open chromatin regions (Kalpana et al., 1994) or by 
modulating expression of the integrated provirus (Turelli et al., 2001). Recent studies 
demonstrated that GFP-tagged Inil was exported from the nuclei of infected cells and co- 
localized with incoming sub-yiral particles (Turelli et al., 2001). Inil has also been 

30 reported to enhance the release of infectious HIV particles; detectable amounts of Inil 
have been shown encapsulated within virions (Yung et al., 2001). 
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ive therapies for the treatment and prev^fei of retroviral virus 



In order to develop^ 
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infectious diseases, such as AIDS, elucidation of the mechanisms by which it replicates in 
the human host is essential. Therefore, it is apparent that there still exists a need in the art 
for the identification and characterization of the cofactors which participate in the 
retroviral integration process. 

The cloning and characterization of the protein Lens Epithelial Cell derived Growth 
Factor (LEDGF) is described in PCT/US98/09801. The patent application describes the 
stimulating activity of LEDGF on the proliferation of different cell types, particularly 
those of epithelial character, such as lens epithelial cells. In general, methods for treating 
cancer, inducing wound healing, inducing or inhibiting cell-death, cell differentiation and 
cell proliferation are provided. Also antibodies against LEDGF are described. 
The LEDGF protein has been described as the positive transcription cofactor PC4- 
interacting protein (Ge et al., 1998). Two alternatively spliced cDNA clones were isolated 
from a HeLa library coding for two proteins p52 (333 amino acids, as predicted from the 
cDNA) and p75 (530 aminoacids) sharing identical 325 amino terminal residues. The 
transcripts coding for p75 and p52 were detected in different cell types and tissues, p52 
being most abundant in testis and p75 in thymus. Independently, a cDNA clone coding 
for a protein identical to p75 has been isolated from a lens epithelium cell library (Singh 
et al, 1999). Overexpression of the protein stimulated survival of diverse primary cells 
and cell lines and enhanced their resistance to oxidative and hyperthermic stress (hence 
LEDGF, "lens epithelium derived growth factor"). The same protein has also been 
identified as the DFS70 autoantigen, antibodies to which were found in some cases of 
atopic dermatitis, asthma and interstitial cystitis (Ochs et al., 2000). One case of acute 
myeloid leukemia with a chromosomal translocation resulting in a fusion of the NUP98 
and LEDGF genes has been described (Ahuja et al., 2000). 

SUMMARY OF THE INVENTION 

The present invention demonstrates that in the nuclear extracts of human cells stably 
expressing the HIV integrase protein from a synthetic gene, IN exists as part of a large 
distinct complex with apparent Stokes radius of 61 A, which dissociates upon dilution 
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41 A core is a tetramer of IN, whereas 61 A molecules are composed of IN tetramers 
associated with a cellular protein with an apparent molecular weight of 76 kDa. In the 
present invention, this integrase interacting protein (Inip76) has been identified as being 
5 identical to LEDGF/DFS70/p75, a protein implicated in regulation of gene expression and 
cellular stress-response. HIV-1 IN and Inip76 co-localized in the nuclei of human cells 
stably expressing IN. Furthermore, the present invention demonstrates that recombinant 
Inip76 strongly promotes strand-transfer activity of HTV-1 IN in vitro. The present 
invention thus provides that the minimal IN molecule in human cells is a tetramer and that 
10 Inip76 plays an important role in retroviral integration. 

Therefore the present invention relates to proteins of the hepatoma-derived growth factor 
family, more in particular Inip76, the polynucleotides expressing them and molecules 
binding or interacting with the proteins or the polynucleotides. The present invention 

15 relates to isolated and purified polynucleotide encoding the Inip76 protein, the complex of 
Inip76 with the integrase and their use. The present invention also relates to the Inip76 
protein and its use. Furthermore, the present invention relates to molecules which 
comprise a region specifically binding to the Inip76 protein of the hepatoma-derived 
growth factor family or nucleic acids encoding said proteins and which suppress or 

20 prevent retroviral replication and to their use. The present invention also relates to 
antibodies to Inip76 and to antisense oligonucleotides, antigene therapy, RNA 
interference and ribozymes for inhibiting the expression of Inip76 and their use. 
The present invention also relates to a protein complex comprising a retroviral integrase 
and Inip76, each of them natural or recombinant. The invention also relates to a method 

25 of stimulating retroviral integration using Inip76 or functional analogues thereof and to 
the use of Inip76 as a cellular cofactor of retroviral integration. 

Furthermore, the present invention relates to the HIV-integrase, characterised in that said 
integrase is in its tetrameric form and to its use. 

30 The present invention relates to the use of an isolated and purified polynucleotide 
encoding the Inip76 protein or an intermediate or a fragment of said protein, an allelic 
variant, a homologue, a portion or a mutation thereof, for the prevention, treatment or 
diagnosis of viral infections, more in particular a retroviral infection, yet more 
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particular embodiment, the present invention also relates to the use of said 
polynucleotides for the manufacture of a medicament for the prevention and/or treatment 
of viral infections or for the manufacture of a diagnostic tool in the viral field, more 
particularly the HIV-field and to the use of said polynucleotides for the screening of 
molecules for their anti-viral activity. Another embodiment of the present inventin relates 
to a polynucleotide encoding the Inip76 protein or an intermediate or a fragment of said 
protein, an allelic variant, a homologue, a portion or a mutation thereof, further 
comprising a polynucleotide which codes for a least a portion of the HIV integrase, more 
in particular the combination of both polynucleotides being arranged in such a way that a 
fusion protein results after expression. 

The present invention also relates to vectors comprising at least a polynucleotide 
encoding the Inip76 protein or an intermediate or a fragment of said protein, an allelic 
variant, a homologue, a portion or a mutation thereof and to the use of said vector for the 
manufacture of a medicament for the prevention and/or treatment of viral infections or for 
the manufacture of a diagnostic tool in the viral field, more particularly the HIV-field and 
to the use of said vector for the screening of molecules for their anti-viral activity. 

The present invention furthermore relates to the use of the proteins of the hepatoma- 
derived growth factor family, more in particular the Inip76 protein, a homologue, a 
variant, a mutated form or a fragment thereof, for the manufacture of a medicament for 
the prevention and/or treatment of a viral infection, more in particular a lentiviral 
infection, yet more particularly a HIV-infection. In a particular embodiment the present 
invention relates to the use of said Inip76 protein, a homologue, a variant, a mutated form 
or a fragment thereof, for the manufacture of an HIV-related diagnostic tool, for the 
screening of molecules for their, anti-viral activity or for the crystallisation of a lentiviral- 
integrase. 

The present invention furthermore relates to the use of a molecule which comprises a 
region specifically binding to a protein of the hepatoma-derived growth factor family, 
more particularly Inip76 or nucleic acids encoding said protein of the hepatoma-derived 
growth factor family, for the manufacture of a medicament for the prevention and/or 
treatment of a viral infection. In a particular embodiment, said molecules are 
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characterisecj^fciat they prevent or suppress the protein o^hepatoma-derived growth 
factor family ofinteracting with retroviral integrase. In particular embodiments, these 
molecules can be selected from the group comprising an antibody or any fragment 
thereof; a small molecule specifically binding to M P 76 or to nucleic acids encoding said 
5 Inip76; a ribozyme capable of cleaving nucleic acids encoding Inip76; antigene 
compounds and anti-sense nucleic acids or analogues thereof, capable of inhibiting the 
expression of Inip76 by interacting with nucleic acids encoding Ini P 76. It is therefore an 
object of the present invention to provide antibodies to the Integrase-interacting protein 
(INIP76) and its subunits, and methods for their preparation, including recombinant 

10 means. In another particular embodiment the present invention relates to small interfering 
RNAs and vectors encoding said siRNAs specific for Inip76 and to their use for the 
manufacture of a medicament for the prevention and/or treatment of viral infections.The 
present invention thus relates in general to the use of RNA interference directed against 
Inip76 for the prevention and treatment of lentiviral infections. In another particular 

15 embodiment, the present invention relates to the use of small molecules capable of 
modulating the interaction of a protein of the hepatoma-derived growth factor family with 
integrase, for the manufacture of a medicament for the prevention and/or treatment of a 
viral infection. 



20 It is an object of the present invention to provide a method and associated assay system 
for screening substances such as drugs, agents and the like, for their activity in combating 
the adverse effects of the Integrase-interacting proteins (INIPs) and/or its subunits in 
retrovirus infected mammals. The present invention also relates to a method to identify 
molecules which comprise a region specifically binding to a protein of the hepatoma- 

25 derived growth factor family, more in particular Inip76 or nucleic acids encoding said 
protein of the hepatoma-derived growth factor family, comprising the steps of exposing 
said protein or nucleic acids encoding said protein to at least one molecule whose ability 
to suppress said protein of interacting with a retroviral integrase protein is sought to be 
determined, followed by determining the interaction (i.e. binding, hybriddisation) of said 
30 molecule(s) to said Inip76 protein, to the binding places on teh integrase of said Inip76 
protein or to nucleic acids encoding said protein and finally monitoring the prevention or 
suppression of retroviral replication or integration by at least one of said molecules. A 
particular embodiment of the present invention relates to the above described method 
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where the ability 




ie molecules to suppress the promoting 




tvity on the strand 



transfer activity of HIV is determined. 

On aspect of present inventions is to improve integration of a transgene in a mammalian 
host cell genome by a method to introduce integrase-interacting proteins (INIPs) (gene 
therapy), preferably Inip76, or an active fragment thereof to increase the amount of 
integrase-interacting proteins (INIPs), preferably Inip76, or an active fragment thereof in 
said mammalian cell by methods of transfection known to the man skilled in the art (e.g. 
liposome delivery). In another embodiment, this method may be combined by with 
transfecting host cell with heterologous or exogenous DNA encoding integrase or an 
active fragment thereof. 

In a further embodiment, the present invention relates to methods of isolating and 
characterizing a novel Integrase-interacting protein (IMP). 

In a particular embodiment, the present invention relates to integrase-interacting proteins, 
active fragments thereof, agonists thereof, mimics thereof, and combinations thereof, said 
protein having the following characteristics: 

a) it binds with the retroviral Integrase and stimulates the activity of retroviral 
Integrase and/or stimulates the DNA binding activity of retroviral Integrase and/or 
stimulates the strand-transfer activity of retroviral Integrase 

b) it possesses a molecular weight between 66 kDa to 86 kDa or an apparent 
molecular weight of approximately 76 kDa as determined by SDS-polyacrylamide gel 
electrophoresis. 

Accordingly, it is an object of the present invention to provide an Integrase-interacting 
protein (INIP76), which possesses an apparent molecular weight of approximately 76 kDa 
as determined by SDS-polyacrylamide gel electrophoresis and which exhibits certain 
characteristics and activities associated with the interaction of the Integrase-interacting 
protein (INIP) with Integrase and their combined retroviral integration activity. 
It is another object of the present invention to provide a method for the prevention and/or 
treatment of mammals from a retorviral infection to control the amount or activity of the 
Integrase-interacting proteins (INIPs) or subunits thereof. 
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The present in^Jfcon also relates to a composition conij^^g a) molecule which 
comprises a region specifically interacting with a protein of the hepatoma-derived growth 
factor family or nucleic acids encoding said protein of the hepatoma-derived growth 
factor family, and b) one or more compounds effective in the treatment or prevention of 
viral infections. 



It is an object of the present invention to provide a method for the treatment of mammals 
to control the amount or activity of the Integrase-interacting proteins (INIPs) or subunits 
thereof, so as to treat or avert the adverse consequences of invasive, spontaneous or 

1 0 idiopathic pathological states resultant from HIV infection. 

It is a still further object of the present invention to provide pharmaceutical compositions 
for use in therapeutic methods which comprise or are based upon the Integrase-interacting 
protein (INIP76), its subunits, their binding partner(s), or upon agents or drugs that 
control the production, or that antagonize the activities of the Integrase-interacting protein 

15 (INIP76). 



20 



Other objects and advantages will become apparent to those skilled in the art from a 
review of the description which proceeds with reference to the following illustrative 
drawings. 



BRIEF DESCRIPTION OF THE FIGURES 



Figure 1. Extraction of IN from nuclei of 293T-IN s ala cells. 

25 (A) 293T-IN s ala cells were lysed in 100CSK buffer in the presence of 4 |ug/ml digitonin 
or 0.5% NP40 on ice for 10 minutes. The supernatant (S) and nuclear pellet (P) fractions 
were recovered and analyzed by western blotting with an anti-IN antibody. The first lane 
contains the total cell extract. 

(B) 293T-IN s ala cells were lysed in 100CSK buffer supplemented with 0.5% NP40 on ice 
30 for 10 minutes, and the extracted nuclei were re-suspended in CSK buffer containing 0 - 
500 mM NaCl. After centrifugation, supernatants (S) and nuclear pellets (P) were 
analyzed by western blotting with an anti-IN antibody. The total nuclear protein was 
loaded in the first lane (nuc). 
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(C) NP40-permeal 
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incubated in 100CSK buffer with (+DNaseI) or without (-DNasel) DNAsel 
(250 units/ml) at 25 °C for 10 (10') or 30 (30') minutes. Following centrifugation, the 
superaatants (S) and pellets (P) were separated in an 1 1% SDS PAGE gel, the upper part 
of which was used for the immunoblot to detect MCM3 (91 kDa) and the lower part to 
detect IN (32 kDa). The total cytoplasmic and nuclear protein fractions were loaded in the 
first (cyt) and the second (nuc) lanes, respectively. 

(D) NP40-permeabilized nuclei from 293T-IN s ala cells were incubated in 100CSK buffer 
with or without DNasel (250 units/ml) for 10 minutes, pelleted by centrifugation and 
resuspended in ice-cold hypotonic buffer (2 mM EDTA, 2 mM Hepes [pH7.5]). After 
centrifugation, supernatants (S) and pellets (P) were analyzed by western blotting with 
anti-IN antibodies. The first lane (nuc) contains total nuclear protein. 

Figure 2. Cross-linking of the IN and BV f complexes with DTSSP. 

(A) The nuclear extract from 293T-IN s ala cells was prepared in the 400CSK buffer, was 
cross-linked with DTSSP and separated in a non-reducing 4-12% SDS PAGE gel. The 
IN-containing cross-linking adducts were detected by western blotting with a polyclonal 
anti-IN antibody. Prior to cross-linking, the extract was adjusted to 100 ^g/ml (lanes 1-5), 
20 |ng/ml (lanes 6-9) or 4 ng/ml (lanes 10-13) of total protein. The concentration of 
DTSSP was 0.1 mM (lanes 3, 7, 11), 0.5 mM (lanes 1, 4, 8, 12) or 2.0 mM (lanes 5, 9, 
13). No cross-linker was added to the samples in lanes 2, 6 and 10. The sample in lane 1 
was cross-linked in the presence of 0.2% SDS. 

(B) Cross-linking of the FLAG-tagged IN complexes was done in similar conditions as 
in (A). Only lanes containing samples cross-linked with 2 mM DTSSP are shown; the 
sample in lane 1 was cross-linked in the presence of 0.2% SDS. The p300 cl , pl50 cl , p60 cl , 
the IN monomer bands and the positions of the MW markers are indicated. The «470 kDa 
mark corresponds to the band of the catalytic subunit of DNA PK (MW 469 kDa), which 
was detected in a separate lane with a monoclonal anti-DNA PKcs antibody. 

Figure 3. Determination of sizes and molecular weights of the IN complexes. 

(A) Chromatography of nuclear extracts from 293T-ESf s ala cells was carried out on a 
calibrated Superdex 200 column. Prior to chromatography, the extract was adjusted to 
550 fag/ml or 30 jag/ml of total protein; the collected fractions (1-26) were tested for the 
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presence of IN^^estern blotting. The elution volumes (vj^l the respective partition 
coefficients (Kav) for the observed IN peaks are indicated. 

(B, C) Determination of the Stokes radii and approximate molecular weights of the IN 
complexes from the experimental Ka V values. The partition coefficients for the standard 
proteins were determined in the same conditions (thyr., thyroglobulin, Ka V = 0.039; 
ferritin, = 0.17; catalase, = 0.26; aldolase, Ka V = 0.28; BSA, bovine serum 
albumin, Ka V = 0.39; chymotr., chymotrypsinogen A, Ka V = 0.59). 



Figure 4. Immunoprecipitation of the FLAG-tagged IN from diluted nuclear 
10 extracts, 

(A) Nuclear extracts prepared from metabolically labeled 293T-IN*ala and 293T- 
IN*alaFLAG cells were diluted to 200 jig/ml of total protein and immunoprecipitated with 
the anti-FLAG M2 antibody and protein G agarose beads for 16 hours at 4 °C. The protein 
was eluted by boiling in SDS PAGE sample buffer and separated in a 4-20% SDS PAGE 

15 gel. Radioautograph of the gel is presented. 

(B) FLAG-tagged IN was immunoprecipitated from a diluted nuclear extract of non- 
labeled 293T-IN s alaFLAG cells overnight. The protein was eluted with FLAG peptide in 
400CSK buffer and cross-linked with DTSSP. The reaction conditions are similar to those 
in Figure 2. 

20 (C) The protein immunoprecipitated and eluted as in (B) was subjected to gel filtration on 
a Superdex 200 column. The fractions collected (1-26) were analyzed by western blotting 
with anti-IN antibodies. The R s value corresponding to the observed peak was determined 
as in Figure 3B. 



25 Figure 5. Co-immunoprecipitation of FLAG-tagged IN with Inip76 from non-diluted 
nuclear extracts. 

(A) Nuclear extracts from 293T-IN s alaFLAG and 293T cells (700 ^ig/ml of total protein) 
were incubated with anti-FLAG M2 antibody and protein G agarose beads for 4 hours. 
The beads were washed as described in materials and methods and bound proteins were 
30 eluted with FLAG peptide in 400CSK buffer. The eluted proteins were concentrated by 
precipitation with TCA, re-dissolved in SDS sample buffer, separated in 4-20% 
denaturing PAGE gels and visualized by silver staining. Lane 1: immunoprecipitate of a 
nuclear extract from 60xl0 6 293T-IN s alaFLAG cells. Lanes 2 and 3: 
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immunoprecipitati^^as done in parallel with nuclear extract^fen 293T and 293T- 
BfalaFLAG cells. The bands of INf, Inip76, the heavy and the light chains of the anti- 
FLAG M2 IgGl antibody as well as aprotinin (protease inhibitor present in 400CSK 
buffer) are indicated. Two sets of molecular weight markers were used in both gels to 
determine the apparent molecular weight of Inip76. Positions of the molecular weight 
markers are shown. 

(B) Nuclear extract from 20 xlO 6 293T-IN s alaFLAG cells («700 |ig/ml total protein) was 
immunoprecipitated for either 4.5 hours (left lane) or 18 hours (right lane). The protein 
was then eluted and analyzed as in (A). 



Figure 6. Inip76 is identical to LEDGF/DFS70/p75, appears to be a component of the 
61 A complex and stimulates IN activity in vitro. 

(A) The Coomassie blue-stained PVDF membrane used for ammo-terminal 
microsequencing of Inip76. The INf-Inip76 complex was eluted from protein G agarose- 

15 immobilized anti-FLAG M2 antibody with FLAG peptide, separated in reducing 4-20% 
SDS PAGE gel and transferred onto the PVDF membrane (lane 1). The proteins left on 
the beads after incubation with FLAG peptide were eluted with SDS sample buffer (lane 
2). The bands corresponding to Inip76, INf, the heavy (H) and light (L) chains of the anti- 
FLAG M2 antibody, aprotinin and the MW markers are indicated. 

20 (B) Deconvoluted and sequence-converted MS/MS spectrum of a doubly charged peptide 
ion (m/z 982.57) obtained from the in-gel tryptic digest of Inip76 corresponding to the 
LEDGF peptide N425-K442. The observed b- and y-dominant fragment ions are 
indicated (Biemann, 1990). 

(C) Co-immunoprecipitation of LEDGF and FLAG-tagged IN from a nuclear extract of 
25 293T-IN s alaFLAG cells. Immunoprecipitation was carried with anti-FLAG (lanes 2 and 

2'), anti-LEDGF (lanes 3 and 3') or no antibody (lanes 1 and 1'). After 4 hours 
incubation, protein G agarose beads with precipitated protein complexes were washed 
with three changes of 400CSK buffer and re-suspended in reducing SDS PAGE sample 
buffer, followed by western blotting to detect DSf f and Inip76/LEDGF (lanes l'-3'). The 
30 lanes 1-3 contain the immunoprecipitation supernatants. 

(D) Elution of the 61 A IN complex from a gel filtration column is shifted after pre- 
incubation with anti-LEDGF antibody. A nuclear extract of 293T-IN s alaFLAG cells was 
preincubated with 3 jag/ml anti-HA (control IgGl) or anti-LEDGF antibody and separated 
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by chromatogr; 




on a Superdex 200 column. INf was 




:ed in the fractions by 



western blotting. The void volume of the column was 8.3 ml, approximately 
corresponding to fraction 2. 

(E) Cross-linking of the Inip76-INf complex with DTSSP. The Inip76/INf complex was 
5 immunoprecipitated from a nuclear extract prepared from 293T-IN s alaFLAG cells with 

anti-FLAG M2 antibody and protein G agarose for 4.5 hours. The protein was eluted with 
FLAG peptide and incubated with 2mM DTSSP in the presence (lanes 1 and V) or 
absence (lanes 2 and 2 9 ) of 0.2% SDS. The cross-linked samples were then separated in a 
non-reducing 4-12% SDS PAGE gel and immunoblotted with polyclonal anti-IN (left 
10 blot) or monoclonal anti-LEDGF (right blot) antibodies. The positions of INf and Inip76 
as well as the cross-Unking adducts pl50 cl , p300 cl , pHMWi cl and pHMW2 Cl are indicated. 
Anti-FLAG M2 IgGl present in the sample is detected on the anti-LEDGF western blot. 

(F) Recombinant Inip76 enhances HIV-1 IN strand transfer activity in vitro. Mini-HIV 
DNA was pre-incubated with HIV-1 IN for 7 minutes at room temperature. Next, 

15 0-0.8 |xM His6-tagged Inip76 was added to the reactions that were further incubated at 
37°C for 90 minutes. The concentrations of IN and Inip76 used in the reactions are 
indicated. The reactions contained 125 ng mini-HIV DNA, 110 mM NaCl, 20 mM Hepes 
[pH 7.5], 5 mM DTT and 5 juM ZnCh in a final volume of 20 |xl. The reactions were 
stopped by addition of 0.5% SDS and 25 mM EDTA and the samples were digested with 

20 0.25 mg/ml proteinase K at 37 °C for 30 minutes to completely disrupt protein-DNA 
complexes. DNA was then precipitated with ethanol, re-dissolved in Tris-EDTA and 
analyzed by electrophoresis in an 0.8% agarose gel. The positions of the DNA molecular 
weight markers (23.1, 9.4, 6.6, 4.4 and 2.3 kb) are indicated. The gel was stained with 
SybrGold (Molecular Probes). 



Figure 7. Co-localization of FLAG-tagged IN and Inip76/LEDGF in 293T- 
EM 5 alaFLAG cells. 

(A) Confocal laser scanning micrographs of a fixed and permeabilized cell fluorescently 
stained with a combination of monoclonal anti-LEDGF plus Alexa-488 conjugated anti- 
30 mouse antibodies to detect Inip76 (green, Inip76) and rabbit polyclonal anti-FLAG plus 
Alexa-555 conjugated anti-rabbit antibodies (red, IN) to localize FLAG-tagged IN. DNA 
was stained with To-Pro3 iodide (shown as white). The two-color merged image 
(IN+Inip76) was produced by overlaying the IN and Inip76 images. 



25 
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(B) Both IN f and 




r 6 are associated with condensed chrom! 




tes during mitosis. 



Immunofluorescent staining was performed as described in (A). 

(C) DNA PKcs and INf display no significant co-localization. INf (red, IN) was detected 
as in (A); DNA PKcs (red, PKcs) was localized with monoclonal anti-DNA PKcs 
antibody plus Alexa-555 conjugated anti-mouse antibody. The two color IN-f-PKcs image 
is an overlay of the IN and PKcs images. 

Figure. 8. Effect of RNA interference-mediated knock-down of Inip76/LEDGF 
expression in 293T-IN s alaFLAG cells on HIV IN distribution* 

Inip76 (A, B) and IN (C) were detected by indirect immunofluorescence in non- 
transfected cells (A), cells transfected with 76A siRNA (B, C). Immunofluorescence, 
chromosomal DNA staining and detection was done as described in Example VI. 

Figure 9. Purification of the Inip76-IN complex from bacteria co-expressing the two 
proteins. 

(A) Elution of the IN-Inip76 complex from NiNTA Sepharose; 7 fractions collected were 
analyzed by 11% SDS PAGE and Coomassie staining. (B) A lysate of induced 
PC2LEDGF cells lacking His 6 -tagged IN was incubated with NiNTA Sepharose and the 
bound proteins were eluted with 200 mM imidazol. The band corresponding to Inip76 is 
not present on this gel, confirming that free Inip76 is not associating with NiNTA 
Sepharose. (C) Further purification of the IN-Inip76 complex by cation exchange 
chromatography. The fractions 3, 4, 5 and 6 eluted from NiNTA (see panel A). They were 
pooled and loaded onto a 1 ml HiTrap SP Sepharose column. The complex was eluted 
with linear gradient of NaCl concentration. The non-bound (flow-through) fraction as 
well as the fractions #10 through #28 were analyzed by 11 % SDS PAGE and Coomassie 
staining. Positions of the molecular weight markers (kDa) are indicated. 

Figure 10. Concentration of the enzymatically-active IN-Inip76 complex. 

(A) The complex purified by chromatography on NiNTA and SP Sepharose was 
concentrated using Centricon-30. Equal volumes of the original non-concentrated sample 
(lane 1), the intermediate samples (lanes 2, 3) and the final concentrated sample (lane 4) 
were separated by 11 % SDS PAGE. (B) Activity of the purified and concentrated 
recombinant IN-Inip76 complex was tested in the mini-HIV reaction in conditions 
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described in tl^^ample VII. Positions of the DNA mol^fer weight markers (kb) 
separated in a separate lane of the gel are indicated. 



Figure 11. Recombinant LEDGF/p75 forms a stable complex with HIV-1 IN and 
5 HIV-2 IN, but not with HTLV-2 IN. 

The pull-down assay was performed as described in example 12. The specific buffer 
conditions, i.e. NaCl concentration are indicated. MgCh was present in all reactions. 
Recombinant LEDGF/p75 was incubated with His6-tagged HIV-1 IN (lanes 3 and 5 on 
both gels), HIV-2 IN (lanes 4 and 6 on the upper gel) or HTLV-2 IN (lanes 4 and 6 on the 

10 lower gel). The complexes were recovered on a Ni 2+ -chelating resin (lanes 2-6). During 
the test some of the proteins aggregated, as shown in lanes 8-12. In the control samples 
(lane 2), recombinant IN was omitted. Lanes 1 and 7 reflect protein input in the reactions; 
BSA, LEDGF/p75 en His6~IN were loaded in the same amounts as were present in the 
binding reactions. The respective positions of the proteins are indicated on the left side of 

1 5 the gel; the gel was stained using Coomassie R250. 



Figure 12 

A. Stable knock-down of LEDGF/p75 in transduced MOLT-4 - derived cell lines 

Extracts of polyclonal MOLT-4-derived cell lines were separated using sodium dodecyl 
20 sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) and blotted onto a PVDF 

membrane. LEDGF/p75 was detected using monoclonal antibodies. Extracts of MOLT-4 

(lane 1) and MOLT-4 transduced with the following retroviral vectors: MSCV/U6-cont 

(lane 2) and MSCVAJ6-p75 ( lane 3) are shown. 

B Delay of HIV-1 replication in cell lines with silenced LEDGF/p75. 
25 The MOLT-4-derived cell line MSCV/U6-cont was infected with HIV-1 (ni B ) at different 

moi: — •— (2.10" 4 moi), — T — (2.10* 5 moi), - - (2.10' 6 moi). In parallel the 

MOLT-4-derived cell line MSCV/U6-p75 was infected: o •••• (2.10^ moi), V- 

• - - (2.10" 5 moi), □ (2.10" 6 moi). At different time points after infection p24 

was measured in the supernatant. 

30 
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DETAILED DESCRIPTION OF THE INVENTION 



In accordance with the present invention there may be employed conventional molecular 
5 biology, microbiology, and recombinant DNA techniques within the skill of the art. Such 
techniques are explained fully in the literature. See, e.g., Sambrook et al, "Molecular 
Cloning: A Laboratory Manual" (1989); "Current Protocols in Molecular Biology" 
Volumes MH [Ausubel, R. M., ed. (1994)]; "Cell Biology: A Laboratory Handbook" 
Volumes WE [J. E. Celis, ed. (1994))]; "Current Protocols in Immunology" Volumes I-m 
10 [Coligan, J. E., ed. (1994)]; "Oligonucleotide Synthesis" (M. J. Gait ed. 1984); "Nucleic 
Acid Hybridization" [B. D. Hames & S. J. Higgins eds. (1985)]; "Integration And 
Translation"[R D. Hames & S. J. Higgins, eds. (1984)]; "Animal Cell Culture" [R. I. 
Freshney, ed. (1986)]; "hnmnobilized Cells And Enzymes" [IRL Press, (1986)]; B. 
Perbal, "A Practical Guide To Molecular Cloning" (1984). 

15 

Definitions 

If appearing herein, the following terms shall have the definitions set out below. 

20 The terms "Integration-interacting protein, "INIP" and any variants not specifically listed, 
may be used herein interchangeably, and as used throughout the present application and 
claims refer to protein material including single or multiple proteins, and extends to those 
proteins having the amino acid sequence substantially homologous to the sequence of 
LEDGF and the profile of activities of INIP76 as set forth herein and in the Claims. 

25 Accordingly, proteins displaying substantially equivalent or altered activity are likewise 
contemplated. These modifications may be deliberate, for example, such as modifications 
obtained through site-directed mutagenesis, or may be accidental, such as those obtained 
through mutations in hosts that are producers of the complex or its named subunits. Also, 
the terms "Integration-interacting protein", "INIP" and "INIP(s)" are intended to include 

30 within their scope proteins specifically recited herein as well as all substantially 
homologous analogs and allelic variations. 
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A "replicon" genetic element (e.g., plasmid, chromosc^^ virus) that functions as 

an autonomous unit of DNA replication in vivo; i.e., capable of replication under its own 
control. 

A "vector" is a replicon, such as plasmid, phage or cosmid, to which another DNA 
5 segment may be attached so as to bring about the replication of the attached segment. 

A "DNA molecule" refers to the polymeric form of deoxyribonucleotides (adenine, 
guanine, thymine, or cytosine) in its either single stranded form, or a double-stranded 
helix. This term refers only to the primary and secondary structure of the molecule, and 

10 does not limit it to any particular tertiary forms. Thus, this term includes double-stranded 
DNA found, inter alia, in linear DNA molecules (e.g., restriction fragments), viruses, 
plasmids, and chromosomes. In discussing the structure of particular double-stranded 
DNA molecules, sequences may be described herein according to the normal convention 
of giving only the sequence in the 5* to 3' direction along the nontranscribed strand of 

15 DNA (i.e., the strand having a sequence homologous to the mRNA). 

A DNA "coding sequence" is a double-stranded DNA sequence which is transcribed and 
translated into a polypeptide in vivo when placed under the control of appropriate 
regulatory sequences. The boundaries of the coding sequence are determined by a start 
20 codon at the 5 f (amino) terminus and a translation stop codon at the 3 1 (carboxyl) 
terminus. A coding sequence can include, but is not limited to, prokaryotic sequences, 
cDNA from eukaryotic mRNA, genomic DNA sequences from eukaryotic (e.g., 
mammalian) DNA, and even synthetic DNA sequences. A polyadenylation signal and 
integration termination sequence will usually be located 3' to the coding sequence. 

25 

A "promoter sequence" is a DNA regulatory region capable of binding RNA polymerase 
in a cell and initiating transcription of a downstream (3' direction) coding sequence. For 
purposes of defining the present invention, the promoter sequence is bounded at its 3 1 
terminus by the transcription initiation site and extends upstream (5 1 direction) to include 
30 the minimum number of bases or elements necessary to initiate transcription at levels 
detectable above background. Within the promoter sequence will be found a transcription 
initiation site (conveniently defined by mapping with nuclease SI), as well as protein 
binding domains (consensus sequences) responsible for the binding of RNA polymerase. 

17 



10 



WO 2004/029246 PCT/BE2003/000164 

Eukaryotic promo^^will often, but not always, contain ,K L^^ boxes and "CAT" 
boxes. Prokaryotic promoters contain Shine-Dalgamo sequences in addition to the -10 
and -35 consensus sequences. 

The term "oligonucleotide," as used herein in referring to the probe of the present 
invention, is defined as a molecule comprised of two or more (deoxy)ribonucleotides, 
preferably more than three. Its exact size will depend upon many factors which, in turn, 
depend upon the ultimate function and use of the oligonucleotide. A "polynucleotide" as 
used herein comprises at least 20 (deoxy)ribonucleotides. 



The term "primer" as used herein refers to an oligonucleotide, whether occurring naturally 
as in a purified restriction digest or produced synthetically, which is capable of acting as a 
point of initiation of synthesis when placed under conditions in which synthesis of a 
primer extension product, which is complementary to a nucleic acid strand, is induced, 

15 i.e., in the presence of nucleotides and an inducing agent such as a DNA polymerase and 
at a suitable temperature and pH. The primer may be either single-stranded or double- 
stranded and must be sufficiently long to prime the synthesis of the desired extension 
product in the presence of the inducing agent. The exact length of the primer will depend 
upon many factors, including temperature, source of primer and use of the method. For 

20 example, for diagnostic applications, depending on the complexity of the target sequence, 
the oligonucleotide primer typically contains 15-25 or more nucleotides, although it may 
contain fewer nucleotides. 



The primers herein are selected to be "substantially" complementary to different strands 
25 of a particular target DNA sequence. This means that the primers must be sufficiently 
complementary to hybridize with their respective strands. Therefore, the primer sequence 
need not reflect the exact sequence of the template. For example, a non-complementary 
nucleotide fragment may be attached to the 5 f end of the primer, with the remainder of the 
primer sequence being complementary to the strand. Alternatively, non-complementary 
30 bases or longer sequences can be interspersed into the primer, provided that the primer 
sequence has sufficient complementarity with the sequence of the strand to hybridize 
therewith and thereby form the template for the synthesis of the extension product. 
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A cell has beeir 




been introduced inside the cell. The transforming DNA may or may not be integrated 
(covalently linked) into chromosomal DNA making up the genome of the cell. In 
prokaryotes, yeast, and mammalian cells for example, the transforming DNA may be 
5 maintained on an episomal element such as a plasmid. With respect to eukaryotic cells, a 
stably transformed cell is one in which the transforming DNA has become integrated into 
a chromosome so that it is inherited by daughter cells through chromosome replication. 
This stability is demonstrated by the ability of the eukaryotic cell to establish cell lines or 
clones comprised of a population of daughter ells containing the transforming DNA. A 
10 "clone" is a population of cells derived from a single cell or common ancestor by mitosis. 
A "cell line" is a clone of a primary cell that is capable of stable growth in vitro for many 
generations. 

Two DNA sequences are "substantially homologous" when at least about 75% 
15 (particularly at least about 80%, and most particularly at least about 90 or 95%) of the 
nucleotides match over the defined length of the DNA sequences. Sequences that are 
substantially homologous can be identified by comparing the sequences using standard 
software available in sequence data banks, or in a Southern hybridization experiment 
under, for example, stringent conditions as defined for that particular system. Defining 
20 appropriate hybridization conditions is within the skill of the art. See, e.g., Maniatis et al., 
supra; DNA Cloning, Vols. I & II, supra; Nucleic Acid Hybridization, supra. 
Two amino acid sequences are "substantially homologous" when at least about 70% of 
the amino acid residues (preferably at least about 80%, and most preferably at least about 
90 or 95%) are identical, or represent conservative substitutions. 



A "heterologous" region of the DNA construct is an identifiable segment of DNA within a 
larger DNA molecule that is not found in association with the larger molecule in nature. 
Thus, when the heterologous region encodes a mammalian gene, the gene will usually be 
flanked by DNA that does not flank the mammalian genomic DNA in the genome of the 
30 source organism. Another example of a heterologous coding sequence is a construct 
where the coding sequence itself is not found in nature (e.g., a cDNA where the genomic 
coding sequence contains introns, or synthetic sequences having codons different than the 



25 



WO 2004/029246 PCT/BE2003/000164 

native gene). Alleviations or naturally-occurring mutationa^^nts do not give rise 
to a heterologous region of DNA as defined herein. 

An "antibody 1 * is any immunoglobulin, including antibodies and fragments thereof, that 
5 binds a specific epitope. The term encompasses polyclonal, monoclonal, and chimeric 
antibodies, the last mentioned described in further detail in U.S. Pat. Nos. 4,816,397 and 
4,816,567. 

An "antibody combining site" is that structural portion of an antibody molecule comprised 
of heavy and light chain variable and hypervariable regions that specifically binds 
10 antigen. 

The phrase "antibody molecule" in its various grammatical forms as used herein 
contemplates both an intact immunoglobulin molecule and an immunologically active 
portion of an immunoglobulin molecule. 

Exemplary antibody molecules are intact immunoglobulin molecules, substantially intact 
15 immunoglobulin molecules and those portions of an immunoglobulin molecule that 
contains the paratope, including those portions known in the art as Fab, Fab', F(ab').sub.2 
and F(v), which portions are preferred for use in the therapeutic methods described 
herein. 

Fab and FCab^.sub^ portions of antibody molecules are prepared by the proteolytic 
20 reaction of papain and pepsin, respectively, on substantially intact antibody molecules by 
methods that are well-known. See for example, U.S. Pat. No. 4,342,566 to Theofilopolous 
et al. Fab 1 antibody molecule portions are also well-known and are produced from F(ab') 
portions followed by reduction of the disulfide bonds linking the two heavy chain 
portions as with mercaptoethanol, and followed by alkylation of the resulting protein 
25 mercaptan with a reagent such as iodoacetamide. An antibody containing intact antibody 
molecules is preferred herein. 

The phrase "monoclonal antibody" in its various grammatical forms refers to an antibody 
having only one species of antibody combining site capable of immunoreacting with a 
particular antigen. A monoclonal antibody thus typically displays a single binding affinity 
30 for any antigen with which it immunoreacts. A monoclonal antibody may therefore 
contain an antibody molecule having a plurality of antibody combining sites, each 
immunospecific for a different antigen; e.g., a bispecific (chimeric) monoclonal antibody. 
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The phrase "phtBlceutically acceptable" refers to molecula^feties and compositions 
that are physiologically tolerable and do not typically produce an allergic or similar 
untoward reaction, such as gastric upset, dizziness and the like, when administered to a 
human. 



The phrase "therapeutically effective amount" is used herein to mean an amount sufficient 
to prevent, and preferably reduce by at least about 30 percent, more preferably by at least 
50 percent, most preferably by at least 90 percent, a clinically significant change in the 
activity of the HIV infection. 



A DNA sequence is "operatively linked" to an expression control sequence when the 
expression control sequence controls and regulates the transcription and translation of that 
DNA sequence. The term "operatively linked" includes having an appropriate start signal 
(e.g., ATG) in front of the DNA sequence to be expressed and maintaining the correct 
reading frame to permit expression of the DNA sequence under the control of the 
expression control sequence and production of the desired product encoded by the DNA 
sequence. If a gene that one desires to insert into a recombinant DNA molecule does not 
contain an appropriate start signal, such a start signal can be inserted in front of the gene. 

The term "standard hybridization conditions" refers to salt and temperature conditions 
substantially equivalent to S.times.SSC and 65. degree. C. for both hybridization and 
wash. However, one skilled in the art will appreciate that such "standard hybridization 
conditions" are dependent on particular conditions including the concentration of sodium 
and magnesium in the buffer, nucleotide sequence length and concentration, percent 
mismatch, percent forrnamide, and the like. Also important in the determination of 
"standard hybridization conditions" is whether the two sequences hybridizing are RNA-- 
RNA, DNA-DNA or RNA-DNA. Such standard hybridization conditions are easily 
determined by one skilled in the art according to well known formulae, wherein 
hybridization is typically 10-20.degree. C. below the predicted or determined T.sub.m 
with washes of higher stringency, if desired. 
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As used herein, '^^means picogram, "ng" means nanogram^^" or ".mu.g" mean 
microgram, "mg" means milligram, "ul" or .mu.l" mean microliter, "ml" means milliliter, 
"1" means liter. 



Description 



Using a synthetic gene, efficient expression of HIV-1 IN in human cells had been 
achieved in the past (Cherepanov et al., 2000). In the present invention, HIV-1 IN protein 

10 complexes present in nuclear extracts from cells stably expressing this viral protein have 
been characterized. A HIV integrase-interacting protein that forms a distinct complex 
with IN in human cells and an insight into the oligomeric state of intracellular HIV IN, 
suggesting that the minimal cellular IN complex is a tetramer is provided in the present 
invention. It is an object of present invention to prevent or inhibit the integration of the 

15 HIV genome into the host cell chromosomes by inhibiting the interaction between the 
integrase interacting protein and integrase. 

The present invention describes the isolation, purification, identification and recombinant 
production of integrase-interacting proteins (INIPs), and methods of use thereof. These 

20 cofactors participate in the HIV-1 integration process, and can be utilized in therapeutic 
and diagnostic methods relating to the treatment of AIDS. Therefore, in its primary 
aspect, the present invention concerns the identification of Integrase-interacting proteins 
(INIPs). In a particular embodiment, the present invention relates to all members of the 
herein disclosed Integrase-interacting proteins (INIPs), and especially to the Integrase- 

25 inhibiting protein, INIP76. 

One such INIP has been determined to be novel cofactor having the following 
characteristics: 

a) it binds with integrase (IN) and specifically stimulates strand-transfer by IN; and 
30 b) it possesses an apparent molecular weight of approximately 76 kDa as determined by 
SDS-polyacrylamide gel electrophoresis. 
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This cofactor. 




lolynucleotides 



expressing said cofactor 




analogs, derivatives or 



fragments of both can be utilized in the various methods of the present invention, since 
they form a complex with integrase. 

One of the characteristics of the present Integrase-inhibiting protein ENIP76 is its function 
5 as a cofactor that interacts with IN, thereby participating in the increase in the level of 
integration of DNA by the HIV-1 regulatory protein IN. 

The present invention also relates to a recombinant DNA molecule or cloned gene or a 
polynucleotide, or a degenerate variant thereof, which encodes an Integration-interacting 

10 protein (INIP), or a fragment thereof, that possesses a molecular weight of about 76 kD, 
more in particular of maximally 76kDa, more in particular an Integrase-inhibiting protein 
INIP76 (INIP76). The present invention relates in particular to a nucleic acid molecule, in 
particular a recombinant DNA molecule or cloned gene, encoding the Integrase-inhibiting 
protein INIP76 (INEP76), and which has a nucleotide sequence substantially homologous 

15 or is complementary to the DNA sequence of LEDGF, DFS70 or p75 . 

The human and murine DNA sequences of the Integrase-inhibiting protein IN1P76 of the 
present invention or portions thereof, may be prepared as probes to screen for 
complementary sequences and genomic clones in the same or alternate species. The 

20 present invention extends to probes so prepared that may be provided for screening cDNA 
and genomic libraries for the Integrase-inhibiting protein INIP76 (INIP76). For example, 
the probes may be prepared with a variety of known vectors, such as the phage .lambda, 
vector. The present invention also includes the preparation of plasmids including such 
vectors, and the use of the DNA sequences to construct vectors expressing antisense RNA 

25 or ribozymes which would attack the mRNAs of INIP76. Correspondingly, the 
preparation of antisense RNA and ribozymes are included herein. 

In a further embodiment of the invention, the full DNA sequence of the recombinant 
DNA molecule or cloned gene so determined may be operatively linked to an expression 
30 control sequence which may be introduced into an appropriate host. The invention 
accordingly extends to unicellular hosts transformed with the cloned gene or recombinant 
DNA molecule comprising a DNA sequence encoding the present Integrase-interacting 
protein, INIP76 
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According to oth^^eferred features of certain preferred em^^mients of the present 
invention, a recombinant expression system is provided to produce biologically active 
animal or human Integration-interacting protein (INIP). 



5 Also within the scope of the present invention are DNA or RNA sequences 
(polynucleotides) having a sequence substantially homologous to the DNA sequence 
which codes LEDGF which code for a Integration-interacting protein (INIP), with an 
activity substantially that of INIP76. Mutations can be in this DNA sequences such that a 
particular codon is changed to a codon which codes for a different amino acid. Such a 

10 mutation is generally made by making the fewest nucleotide changes possible. A 
substitution mutation of this sort can be made to change an amino acid in the resulting 
protein in a non-conservative manner (i.e., by changing the codon from an amino acid 
belonging to a grouping of amino acids having a particular size or characteristic to an 
amino acid belonging to another grouping) or in a conservative manner (i.e., by changing 

15 the codon from an amino acid belonging to a grouping of amino acids having a particular 
size or characteristic to an amino acid belonging to the same grouping). Such a 
conservative change generally leads to less change in the structure and function of the 
resulting protein, A non-conservative change is more likely to alter the structure, activity 
or function of the resulting protein. The present invention should be considered to include 

20 sequences containing conservative changes which do not significantly alter the activity or 
binding characteristics of the resulting protein. 

The concept of the Integration-interacting protein (INIP) contemplates that specific 
factors exist for correspondingly specific ligands, such as Integration-interacting protein 

25 (INIP) and the like, as described earlier. Accordingly, the exact structure of each 
Integration-interacting protein (INIP) will understandably vary so as to achieve this ligand 
and activity specificity. It is this specificity and the direct involvement of the Integration- 
interacting protein (IMP) in the chain of events leading to the level of integration by IN 
that offers the promise of a broad spectrum of diagnostic and therapeutic utilities for 

30 retroviral infection. 

The present invention naturally contemplates several means for preparation of the 
Integrase-interacting proteins (INIPs), including as illustrated herein known 
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preparations within its scope. The known cDNA and amino acid sequences of 
LEDGF/DFS70/p75 facilitates the reproduction of the Integrase-interacting proteins 
(INIPs) by such recombinant techniques, and accordingly, the invention extends to 
expression vectors prepared from the disclosed DNA sequences for expression in host 
systems by recombinant DNA techniques, and to the resulting transformed hosts. In 
present invention we also produced the recombinant INIP76-IN complex. 

Another feature of this invention is the expression of the DNA sequences encoding 
INIP76. As is well known in the art, DNA sequences may be expressed by operatively 
linking them to an expression control sequence in an appropriate expression vector and 
employing that expression vector to transform an appropriate unicellular host. 

Such operative linking of a DNA sequence of this invention to an expression control 
sequence, of course, includes, if not already part of the DNA sequence, the provision of 
an initiation codon, ATG, in the correct reading frame upstream of the DNA sequence. 

A wide variety of host/expression vector combinations may be employed in expressing 
the DNA sequences of this invention. Useful expression vectors, for example, may consist 
of segments of chromosomal, non-chromosomal and synthetic DNA sequences. Suitable 
vectors include derivatives of SV40 and known bacterial plasmids, e.g., E. coli plasmids 
col El, pCRl, pBR322, pMB9 and their derivatives, plasmids such as RP4; phage DNAS, 
e.g., the numerous derivatives of phage .lambda., e.g., NM989, and other phage DNA, 
e.g., M13 and filamentous single stranded phage DNA; yeast plasmids such as the 2.mu. 
plasmid or derivatives thereof; vectors useful in eukaryotic cells, such as vectors useful in 
insect or mammalian cells; vectors derived from combinations of plasmids and phage 
DNAs, such as plasmids that have been modified to employ phage DNA or other 
expression control sequences; and the like. 

Any of a wide variety of expression control sequences-sequences that control the 
expression of a DNA sequence operatively linked to it-may be used in these vectors to 
express the DNA sequences of this invention. Such useful expression control sequences 
include, for example, the early or late promoters of SV40, CMV, vaccinia, polyoma or 
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adenovirus, the la^^tem, the tip system, the TAC system, th^^C system, the LTR 
system, the major operator and promoter regions of phage .lambda., the control regions of 
fd coat protein, the promoter for 3-phosphoglycerate kinase or other glycolytic enzymes, 
the promoters of acid phosphatase (e.g., Pho5), the promoters of the yeast .alpha. -mating 
factors, and other sequences known to control the expression of genes of prokaryotic or 
eukaryotic cells or their viruses, and various combinations thereof. 



A wide variety of unicellular host cells are also useful in expressing the DNA sequences 
of this invention. These hosts may include well known eukaryotic and prokaryotic hosts, 
10 such as strains of E, coli, Pseudomonas, Bacillus, Streptomyces, fungi such as yeasts, and 
animal cells, such as CHO, Rl.l, B-W and L-M cells, African Green Monkey kidney 
cells (e.g., COS 1, COS 7, BSC1, BSC40, and BMT10), insect cells (e.g., Sf9), and 
human cells and plant cells in tissue culture. 

15 It will be understood that not all vectors, expression control sequences and hosts will 
function equally well to express the DNA sequences of this invention. Neither will all 
hosts function equally well with the same expression system. However, one skilled in the 
art will be able to select the proper vectors, expression control sequences, and hosts 
without undue experimentation to accomplish the desired expression without departing 

20 from the scope of this invention. For example, in selecting a vector, the host must be 
considered because the vector must function in it. The vector's copy number, the ability to 
control that copy number, and the expression of any other proteins encoded by the vector, 
such as antibiotic markers, will also be considered. 

25 In selecting an expression control sequence, a variety of factors will normally be 
considered. These include, for example, the relative strength of the system, its 
controllability, and its compatibility with the particular DNA sequence or gene to be 
expressed, particularly as regards potential secondary structures. Suitable unicellular hosts 
will be selected by consideration of, e.g., their compatibility with the chosen vector, their 

30 secretion characteristics, their ability to fold proteins correctly, and their fermentation 
requirements, as well as the toxicity to the host of the product encoded by the DNA 
sequences to be expressed, and the ease of purification of the expression products. 
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Considering th< 




id other factors a person skilled in the 




be able to construct a 



variety of vector/expression control sequence/host combinations that will express the 
DNA sequences of this invention on fermentation or in large scale animal culture. 

5 It is further intended that the preparation of Integration-interacting protein (INIP) analogs 
or fragments may be prepared and derived from the respective nucleotide sequences of 
the protein complex/subunit, falls within the scope of the present invention. Analogs or 
fragments, may be produced, for example, by pepsin digestion of Integration-interacting 
protein (INIP) material. Other analogs, such as muteins, can be produced by standard site- 
10 directed mutagenesis of Integration-interacting protein (INIP) coding sequences. Analogs 
exhibiting "Integration-interacting protein (INIP) activity" such as small molecules, 
whether functioning as promoters or inhibitors, may be identified by known in vivo 
and/or in vitro assays. 

15 As mentioned above, a DNA sequence encoding Integration-interacting protein (INIP) 
can be prepared synthetically rather than cloned. The DNA sequence can be designed 
with the appropriate codons for the Integration-interacting protein (INIP) amino acid 
sequence. In general, one will select preferred codons for the intended host if the 
sequence will be used for expression. The complete sequence is assembled from 

20 overlapping oligonucleotides prepared by standard methods and assembled into a 
complete coding sequence. See, e.g., Edge, Nature, 292:756 (1981); Nambair et al., 
Science, 223:1299 (1984); Jay et al., J. Biol. Chem., 259:6311 (1984). 

Synthetic DNA sequences allow convenient construction of genes which will express 
25 Integration-interacting protein (IMP) analogs or "muteins". Alternatively, DNA encoding 
muteins can be made by site-directed mutagenesis of native Integration-interacting 
protein (INIP) genes or cDNAs, and muteins can be made directly using conventional 
polypeptide synthesis. 

30 A general method for site-specific incorporation of unnatural amino acids into proteins is 
described in Christopher J. Noren, Spencer J. Anthony-Cahill, Michael C. Griffith, Peter 
G. Schultz, Science, 244:182-188 (April 1989). This method may be used to create 
analogs with unnatural amino acids. 
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In general, techniques for nucleic acid or protein/peptide manipulation are well known. 
(See, e.g. Annual Rev. of Biochem. 1992; 61:131-156). Reagents useful in applying such 
nucleic acid manipulation techniques, such as restriction enzymes and the like, are widely 
5 known in the art and commercially available from a number of vendors. Large amounts of 
the nucleic acid sequences encoding the Inip76 or its antagonists or modulators may be 
obtained using well-established molecular biology procedures such as molecular cloning 
and replication of the vector or plasmid carrying the sequences in a suitable host cell. 
Either complete gene sequences or partial sequences encoding the desired Inip76 or its 

10 antagonists or modulators can be employed. The nucleic acid sequences encoding the 
Inip76 or its antagonists or modulators can also be produced in part or in total by 
chemical synthesis, e.g. by the phosphoramidite method described by Beaucage and 
Carruthers (Tetra Letts. 1981; 22:1859-1862) or the triester method (Matteucci et al, J. 
Am, Chem. Soc. 1981; 103:3185) and may be performed on commercial automated 

15 oligonucleotide synthesizers. A double-stranded fragment may be obtained from the 
single-stranded product of chemical synthesis either by synthesizing the complementary 
strand and annealing the strand together under appropriate conditions, or by synthesizing 
the complementary strand using DNA polymerase with an appropriate primer sequence. 

20 The invention includes an assay system for screening of potential drugs effective to 
modulate Integrase-interacting proteins (INIPs) activity of target mammalian cells by 
interrupting or potentiating the Integrase-interacting proteins (INIPs). In one instance, the 
test drug could be administered to a cellular sample with the ligand that activates the 
Integrase-interacting proteins (INIPs), or an extract containing the activated Integrase- 

25 interacting proteins (INIPs), to determine its effect upon the binding activity of the 
Integrase-interacting proteins (INIPs) to any chemical sample (including DNA or 
integrase), or to the test drug, by comparison with a control. 

The assay system could more importantly be adapted to identify drugs or other entities 
that are capable of binding to the Integrase-interacting proteins (INIPs), either 
30 extracellular, in the cytoplasm or in the nucleus, thereby inhibiting or potentiating the 
level of retrovirus integration action or strand-transfer activity of IN. Such an assay would 
be useful in the development of drugs that would be specific against the cellular 
receptivity for integration of exogenous (in the meaning of heterologous or homologous 
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but introduced^^iserted into the same natural cell type bu^^an unnatural state) or of 
introduced DNA in its chromosomal DNA or that would potentiate such activity, in time 
or in level of activity. For example, such drugs might be used to decrease or prevent the 
replication of the HTV-1 virus in its host organism, or such drug might be used to increase 
the integration of a transgene in cellular chromosomal DNA. 

In accordance with the above, an assay system for screening potential drugs effective to 
modulate the activity of the Integration-interacting protein (INIP) may be prepared. The 
Integration-interacting protein (IMP) may be introduced into a test system, and the 
prospective drug may also be introduced into the resulting cell culture, and the culture 
thereafter examined to observe any changes in the Integration-interacting protein (INIP) 
activity of the cells, due either to the addition of the prospective drug alone, or due to the 
effect of added quantities of the known Integration-interacting protein (INIP). 

In yet a further embodiment, the invention contemplates antagonists or modulators 
(increase or decrease) of the activity of a Integration-interacting protein (INIP) and then- 
use, more in particular as an antiviral therapeutic or for the manufacture of a medicament 
for the prevention or treatment of a viral infection, more in particular a lentiviral 
infection, yet more in particular a HTV-infection. The invention also relates to 
compositions containing such antagonists or modulators. The invention relates in 
particular to an agent or molecule that inhibits, influences or modulates binding of 
Ihtegrase-inhibiting protein, INIP76 to integrase. In a specific embodiment, the antagonist 
can be a peptide, more particularly a peptide having the sequence of a portion of the 
binding domain of the Integrase-inhibiting protein INHP76, which binds to the IN protein 
or forms an Inip76-IN complex. In other embodiments, the antagonists are selected from 
small molecules, oligonucleotides (antisense or aptamers), antigene therapeutics, small 
interfering RNAs (or RNA interference in general), soluble receptors, antibodies and/or 
cellular therapies. 

In a further embodiment, The present invention also relates to methods of treating or 
preventing viral infections by using antagonists or modulators of the binding of Inip76 
to IN. The present invention relates to certain therapeutic methods which would be based 
upon the activity of the Integration-interacting protein (INIP)(s), its (or their) subunits, or 
active fragments thereof, or upon agents or other drugs determined to possess the same 
activity. A first therapeutic method is associated with the prevention of the manifestations 



29 



' WO 2004/029246 PCT/BE2003/000164 

of conditions causally related to or following from the binding activity of the Integration- 
interacting protein (INIP) or its subunits, and comprises administering an agent capable of 
modulating the production and/or activity of the Integration-interacting protein (INIP) or 
subunits thereof, either individually or in mixture with each other in an amount effective 
5 to prevent the development of those conditions in the host. For example, drugs or other 
binding partners to the Integration-interacting protein (INIP) or proteins may be 
administered to inhibit Integration-interacting protein (IMP) activity, as in the inhibition 
of Integration-interacting protein (INIP) in anti-HTV-1 therapy. 

Also, the increase of the action of specific tyrosine phosphatases in the phosphorylation 
10 of activated (phosphorylated) Integration-interacting protein (INIP) or proteins presents a 
method for inhibiting the activity of the Integration-interacting protein (INIP) or protein 
that would concomitantly provide a therapy for retroviral infection based on Integration- 
interacting protein (INEP)/protein inactivation. 

More specifically, the therapeutic method generally referred to herein could include the 
15 method for the treatment of various retroviral infection pathologies or may integration of 
a DNA in cellular chromosomal DNA by the administration of pharmaceutical 
compositions that may comprise effective inhibitors of activation of the Integration- 
interacting protein (INIP) or its subunits, or other equally effective drugs developed for 
instance by a drug screening assay prepared and used in accordance with a further aspect 
20 of the present invention. 

Also, an increase of the action of specific tyrosine phosphatases in the dephosphorylation 
of activated Integrase-interacting proteins (INIPs) presents a method for inhibiting or 
decreasing the activity of the Integrase-interacting proteins (INIPs) that would 
concomitantly potentiate therapies based on Integration-interacting protein (INIP) 
25 deactivation. 



In the present invention small molecules, e.g. small organic molecules with a molecular 
mass <500 Da and other suitable molecules, can also function as antagonists or 
modulators of Inip76 in order to prevent or treat a viral infection. Small molecules and 
30 other drug candidates can readily be obtained, for example, from combinatorial and 
natural product libraries and using methods known to the art to screen candidate 
molecules for their Inip76 antagonizing or modulating function. Screening assays in this 
regard are known in the art, as described for example in PCT/US99/ 12001. Furthermore, 
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random peptid^^ries, consisting of all possible combinatid^^f amino acids, attached 
to a solid phase or in solution, may also be used to identify peptides that act as antagonists 
ofInip76. 



5 Suitable antagonists or modulators of the interaction of Inip76 with HIV IN can also be 
developed by known drug design methods, e.g. using structural analysis of the IN, Inip76 
or the complex by employing methods established in the art, for example, using X-ray 
crystallography to analyze the structure of the complex formed ( see for example Sielecki, 
A.R. et al. Science 1989; 243:1346-51; Dhanaraj, V. et al. Nature 1992; 357(6377):466- 
10 72) and/or by modifying known LEDGF binding molecules i.e. "lead compounds," to 
obtain (mere potent) inhibitors and compounds for different modes of administration (i.e. 
oral vs. intravenous). 



The present invention relates to antibodies directed against Inip76, In and disrupting or 
15 modulating the complex formation between Inip76 and IN and the use of said antibodies. 
The invention also relates to the development of antibodies against the Integrase- 
interacting proteins (TNIPs) and to compositions containing them, including naturally 
raised and recombinant^ prepared antibodies. Such antibodies could include both 
polyclonal and monoclonal antibodies prepared by known genetic techniques, as well as 
20 bi-specific (chimeric) antibodies, and antibodies including other functionality suiting 
them for additional diagnostic use conjunctive with their capability of modulating 
Integrase-inhibiting protein INBP76 (INIP76) activity. 

In particular, antibodies against specifically phosphorylated factors can be selected and 
are included within the scope of the present invention for their particular ability in 
25 following activated protein. Thus, activity of the Integration-interacting protein (BSUP) or 
of the specific polypeptides believed to be causally connected thereto may therefore be 
followed directly by the assay techniques discussed later on, through the use of an 
appropriately labelled quantity of the Integration-interacting protein (INIP) or antibodies 
or analogues thereof. 

30 

Also, antibodies including both polyclonal and monoclonal antibodies, and drugs that 
modulate the production or activity of the Integrase-interacting proteins (INIPs) and/or 
their subunits may possess certain diagnostic applications and may for example, be 
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or the like. 



;e of detecting and/or measuring conditio! 



;h as HTV-1 infection 



For example, the Integrase-interacting proteins (INlPs) or their subunits may be used to 
5 produce both polyclonal and monoclonal antibodies to themselves in a variety of cellular 
media, by known techniques such as the hybridoma technique utilizing, for example, 
fused mouse spleen lymphocytes and myeloma cells. Likewise, small molecules that 
mimic or antagonize the activity(ies) of the Integrase-interacting proteins (INIPs) of the 
invention may be discovered or synthesized, and may be used in diagnostic and/or 
10 therapeutic protocols. 

The general methodology for making monoclonal antibodies by hybridomas is well 
known. Immortal, antibody-producing cell lines can also be created by techniques other 
than fusion, such as direct transformation of B lymphocytes with oncogenic DNA, or 
15 transfection with Epstein-Barr virus. See, e.g., M. Schreier et al., "Hybridoma 
Techniques" (1980); Hammerling et al., "Monoclonal Antibodies And T-cell 
Hybridomas" (1981); Kennett et al, "Monoclonal Antibodies" (1980); see also, U.S. Pat. 
Nos. 4,341,761; 4,399,121; 4,427,783; 4,444,887; 4,451,570; 4,466,917; 4,472,500; 
4,491,632; 4,493,890. 

20 Methods for producing monoclonal anti-integration-interacting protein (INIP) antibodies 
are also well-known in the art. See Niman et al., Proc. Natl. Acad. Sci. USA, 80:4949- 
4953 (1983). Typically, the present Integration-interacting protein (INIP) or a peptide 
analog is used either alone or conjugated to an immunogenic carrier, as the immunogen in 
the before described procedure for producing anti-integration-interacting protein (INIP) 

25 monoclonal antibodies. The hybridomas are screened for the ability to produce an 
antibody that immunoreacts with the Integration-interacting protein (INIP) peptide analog 
and the present Integration-interacting protein (INIP). 

Panels of monoclonal antibodies produced against Integration-interacting protein (INIP) 
peptides can be screened for various properties; i.e., isotope, epitope, affinity, etc. Of 
30 particular interest are monoclonal antibodies that neutralize the activity of the Integration- 
interacting protein (INIP) or its subunits. Such monoclonals can be readily identified in 
Integration-interacting protein (INIP) activity assays. High affinity antibodies are also 
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useful when ii^fcao affinity purification of native or recom^^nt Integration-interacting 
protein (INIP) is possible. 

Methods for producing polyclonal anti-polypeptide antibodies are well-known in the art. 
See U.S. Pat. No. 4,493,795 to Nestor et al. A monoclonal antibody, typically containing 
Fab and/or F(ab').sub.2 portions of useful antibody molecules, can be prepared using the 
hybridoma technology described in Antibodies-A Laboratory Manual, Harlow and Lane, 
eds., Cold Spring Harbor Laboratory, New York (1988), which is incorporated herein by 
reference. Briefly, to form the hybridoma from which the monoclonal antibody 
composition is produced, a myeloma or other self-peipetuating cell line is fused with 
lymphocytes obtained from the spleen of a mammal hyperimmunized with a Integration- 
interacting protein (IMP)-binding portion thereof, or Integration-interacting protein 
(INIP), or an origin-specific DNA-binding portion thereof. 

Splenocytes are typically fused with myeloma cells using polyethylene glycol (PEG) 
6000. Fused hybrids are selected by their sensitivity to HAT. Hybridomas producing a 
monoclonal antibody useful in practicing this invention are identified by their ability to 
immunoreact with the present Integration-interacting protein (INIP) and their ability to 
inhibit specified INIP activity in target cells. 



A monoclonal antibody useful in practicing the present invention can be produced by 
initiating a monoclonal hybridoma culture comprising a nutrient medium containing a 
hybridoma that secretes antibody molecules of the appropriate antigen specificity. The 
culture is maintained under conditions and for a time period sufficient for the hybridoma 
to secrete the antibody molecules into the medium. The antibody-containing medium is 
then collected. The antibody molecules can then be further isolated by well-known 
techniques. 

Media useful for the preparation of these compositions are both well-known in the art and 
commercially available and include synthetic culture media, inbred mice and the like. An 
exemplary synthetic medium is Dulbecco's minimal essential medium (DMEM; Dulbecco 
et al, Virol. 8:396 (1959)) supplemented with 4.5 gm/1 glucose, 20 mm glutamine, and 
20% fetal calf serum. An exemplary inbred mouse strain is the Balb/c. 
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Inhibition of expre^^L of a Inip76 can be desirable to treat or^^ent viral infections. 
Where inhibition of expression of Inip76 is desirable, inhibitory nucleic acid sequences 
that interfere with expression of Inip76 at the transcriptional or translational level can also 
be used. The strategy called antisense, antigene or RNA-interference can be applied. 
5 These approaches utilise, for example, antisense nucleic acids, ribozymes, triplex agents 
or siRNAs to block transcription or translation of Inip76 mRNA or DNA or of a specific 
mRNA or DNA of Inip76, either by masking that mRNA with an antisense nucleic acid 
or DNA with a triplex agent, by cleaving the nucleotide sequence with a ribozyme or by 
destruction of the mRNA through a complex mechanism involved in RNA-interference. 
10 The present invention extends to antisense oligonucleotides, ribozymes, RNA interference 
and antigene therapeutics, their use and their preparation. 



Antisense nucleic acids are DNA or RNA molecules or nucleic acid analogs (e.g. hexitol 
nucleic acids, Peptide nucleic acids) that are complementary to at least a portion of a 

15 specific mRNA molecule (Weintraub Scientific American 1990; 262:40). In the cell, the 
antisense nucleic acids hybridise to the corresponding mRNA, forming a double-stranded 
molecule. The antisense nucleic acids interfere with the translation of the mRNA, since 
the cell will not translate a mRNA that is double-stranded. Antisense oligomers of about 
15 nucleotides are preferred, since they are easily synthesised and are less likely to cause 

20 problems than larger molecules when introduced into the target cell which produces 
Inip76. Also nucleic acids or analogs, complementary to the translation initiation site, e.g. 
between -10 or +10 regions of the Inip76 nucleotide sequence, are preferred. 
The potency of antisense oligonucleotides for inhibiting Inip76 may be enhanced using 
various methods including addition of polylysine, encapsulation into liposomes (antibody 

25 targeted, cationic acid, Sendai virus derived, etc.) or into nanoparticels in order to deliver 
the oligonucleotides into cells. Other techniques for enhancing the antisense capacity of 
oligonucleotides exist, such as the conjugation of the antisense oligonucleotides for 
example to "cell penetrating peptides" (Manoharan, M. Antisense Nucleic Acid Drug 
Dev. 2002; 12(2): 103-128 / Juliano, R.-L. Curr. Opin. Mol. Ther. 2000; 2(3): 297-303). 

30 

Use of for example an oligonucleotide or a PNA (Peptide nucleic acid) to stall 
transcription is known as the antigene strategy (e.g. triplex formation) In the case of 
oligonucleotides, the oligomer winds around double-helical DNA (major groove), 
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forming a thr^^anded helix. Therefore, these antigene cJ^lunds can be designed to 
recognise a unique site on a chosen gene and block transcription of that gene in vivo. 
(Maher et al. Antisense Res. and Dev. 1991; 1:227; Helene, C. Anticancer Drug Design 
1991; 6:569 / Casey, B.P. et al. Prog. Nucleic Acid Res. Mol. Biol. 2001; 67: 163-192 / 
Pooga, M. et al. Biomol. Eng. 2001; 17(6): 183-192 / Nielsen, P.E. Pharmacol. Toxicol. 
2000; 86(1): 3-7). Antigene oligonucleotides as wel as PNAs are easily synthesised by the 
man skilled in the art and are even commercially available. 

Ribozymes are molecules possessing the ability to specifically cleave other single 
stranded RNA in a manner analogous to DNA restriction endonucleases. Through the 
modification of nucleotide sequences which encode these RNAs, it is possible to engineer 
molecules that recognise specific nucleotide sequences in an RNA molecule and cleave it 
(Cech J. Amer. Med. Assn. 1988; 260:3030). A major advantage of this approach is that, 
because they are sequence-specific, only mRNAs with particular sequences are 
inactivated. 

There are two basic types of ribozymes namely, tetrahymena-type (Hasselhoff (1988) 
Nature 334:585) and "hammerhead"-type. Tetrahymena-type ribozymes recognise 
sequences which are four bases in length, while "hammerhead"-type ribozymes recognise 
base sequences 1 1-18 bases in length. The longer the recognition sequence, the greater the 
likelihood that the sequence will occur exclusively in the target mRNA species. 
Consequently, hammerhead-type ribozymes are preferable to tetrahymena-type ribozymes 
for inactivating a specific mRNA species and 18-based recognition sequences are 
preferable to shorter recognition sequences. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified 
by scanning the target molecule for ribozyme cleavage sites which include the following 
sequences, GUA, GUU and GUC. Once identified, short RNA sequences of between 15 
and 20 ribonucleotides corresponding to the region of the target gene containing the 
cleavage site may be evaluated for predicted structural features such assecondary 
structure that may render the oligonucleotide sequence unsuitable. The suitability of 
candidate targets may also be evaluated by testing their accessibility to hybridization with 
complementary oligonucleotides, using ribonuclease protection assays. 
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RNA-interference 




Lves the insertion of small pieces of doubli 




tded (ds) and even 



single stranded RNA into a cell. If the dsRNA corresponds with a gene in the cell, it will 
promote the destruction of mRNA produced by that gene, thereby preventing its 
expression. The technique has been shown to work on a variety of genes, even in human 
5 cells and in vivo. For example small interfering RNAs (siKNA), short-hairpin RNAs 
(shRNA) or vectors expressing such nucleic acids can be applied in the RNA-interference 
strategy in order to inhibit the translation of Inip76-mRNA. 

Therefore, the present invention relates to the use of nucleic acids mediating RNA 
interference specific for mRNA of a protein of the hepatoma-derived growth factor 
10 family, for the modulation of the nuclear transport of lentiviral integrase and to the use of 
nucleic acids mediating RNA interference specific for mRNA of a protein of the 
hepatoma-derived growth factor family, for the manufacture of a medicament for the 
prevention and/or treatment of a viral infection. This includes the use of siRNA, shRNA 
and vectors expressing nucleic acids, for RNA interference. 



Antisense RNA, DNA molecules and analogs, ribozymes, antigene compounds or nucleic 
acids for RNA interference of the invention may be prepared by any method known in the 
art for the synthesis of nucleic acid molecules. These include techniques for chemically 
synthesizing oligonucleotides well known in the art such as for example solid phase 

20 phosphoramidite chemical synthesis. Alternatively, RNA molecules may be generated by 
in vitro and in vivo transcription of DNA sequences encoding the antisense RNA 
molecule. The DNA sequences described herein may thus be used to prepare such 
molecules. Such DNA sequences may be incorporated into a wide variety of vectors 
which incorporate suitable RNA polymerase promoters such as the T7 or SP6 polymerase 

25 promoters. Alternatively, antisense cDNA constructs that synthesize anti-sense RNA 
constitutively or inducibly, depending on the promoter used, can be introduced stably into 
cell lines 

Suppression function of Inip76 can also be achieved through administration of variant 
30 polypeptide (dominant negative variant form) of Inip76, or a nucleotide sequence 
encoding variant polypeptide of Inip76. By administering an Inip76 variant polypeptide 
or a nucleotide sequence encoding such polypeptide, the variant will compete with wild- 
type Inip76 for binding to its receptor. 
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Another aspect of the present invention is the use of gene transfer, including gene 
therapy, to deliver above mentioned molecules antagonising or modulating the binding of 
Inip76 to IN. Gene therapy means the treatment by the delivery of therapeutic nucleic 
5 acids to patient's cells. This is extensively reviewed in Lever and Goodfellow 1995; Br. 
Med Bull.,51, 1-242; Culverl995; Ledley, F.D. 1995. Hum. Gene Ther. 6, 1129. By gene 
transfer, a nucleic acid encoding a Inip76-antagonising agent is introduced into cells in a 
subject to express the Inip76-antagonist and inhibit the Inip76 function. To achieve gene 
therapy there must be a method of delivering genes to the patient's cells and additional 

10 methods to ensure the effective production of any therapeutic genes. There are two 
general approaches to achieve gene delivery; these are non-viral delivery and virus- 
mediated gene delivery. Vectors as described herein could be sued for this purpose. 
In one embodiment of the invention, nucleic acid encoding a antagonist or modulator of 
Inip76 is introduced in a subject in order to express the antagonist or modulator and 

1 5 prevent or treat a viral infection. For gene transfer, the key steps are 1) to select the mode 
of delivery, e.g. a proper vector for delivery of the inhibitor genes to the subject, 2) 
administer the nucleic acid to the subject; and 3) achieve appropriate expression of the 
transferred gene for satisfactory durations. Methods for gene transfer are known in the art. 
The method described below are merely for purposes of illustration and are typical of 

20 those that can be used to practice the invention. However, other procedures may also be 
employed, as is understood in the art. Most of the techniques to construct delivery 
vehicles such as vectors and the like are widely practiced in the art, and most practitioners 
are familiar with the standard resource materials which describe specific conditions, 
reagents and procedures which are described in the literature. 

25 

In a specific embodiment of the invention, it is clear that the preventive or treatment 
methods or that the antagonists or modulators of Inip76 as described herein, can also be 
used in combination with any therapy or compound known in the art for viral infections, 
more in particular HIV infections. The invention also relates to synergistic combinations, 
30 especially combinations with integrase inhibitors. Possible anti-viral agents for inclusion 
into the antiviral compositions or combined preparations of this invention include, for 
instance, interferon-alfa (either pegylated or not), nucleoside reverse transcriptase (RT) 
inhibitors (i.e. zidovudine, didanosine, stavudine, lamivudine, zalcitabine and abacavir), 
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non-nucleoside 




;e transcriptase inhibitors (i.e. nevirl 




delavirdine and 



efavirenz), protease inhibitors (i.e. saquinavir, indinavir, ritonavir, nelfinavir, amprenavir 
and lopinavir), integrase inhibitors such as the pyranodipyrimidines, L-708,906 or Si- 
1360, fusion inhibitor enfiivirtide, ribavirin, vidarabine, acyclovir, gancyclovir, 
5 amantadine, rimantadine and other selective inhibitors of the replication of HIV. The 
composition of the invention can also contain drugs having a general beneficial activity 
for virally infected mammals, such as antibiotics or corticosteroids. 

The diagnostic utility of the present invention extends to the use of the present Integrase- 
10 interacting proteins (INIPs) in assays to screen for Integrase-interacting proteins (INIPs), 
to assay the infectability of a subject by a retroviral or measuring conditions such as HIV- 
1 infection or the like. 

In a particular embodiment, antibodies and other antagonists or modulators of Inip76 can 
be used for this purpose. The Integrase-interacting proteins (INIPs), their analogs and/or 
15 analogs, and any antagonists or antibodies that may be raised thereto, are capable of use 
in connection with various diagnostic techniques, including immunoassays, such as a 
radioimmunoassay, using for example, an antibody to the Integration-interacting protein 
(INIP) that has been labeled by either radioactive addition, or radioiodination. 

20 In an immunoassay, a control quantity of the antagonists or antibodies thereto, or the like 
may be prepared and labeled with an enzyme, a specific binding partner and/or a 
radioactive element, and may then be introduced into a cellular sample. 
After the labeled material or its binding partner(s) has had an opportunity to react with 
sites within the sample, the resulting mass may be examined by known techniques, which 

25 may vary with the nature of the label attached. For example, antibodies against 
specifically phosphorylated factors may be selected and appropriately employed in the 
exemplary assay protocol, for the purpose of following activated protein as described 
above. 

In the instance where a radioactive label, such as the isotopes .sup. 3 H, .sup. 14 C, .sup. 32 P, 
30 .sup. 35 S, .sup. 36 CI, .sup. 51 Cr, .sup. 57 Co, .sup. 58 Co, .sup. 59 Fe, .sup. 90 Y, .sup. 125 I, 
.sup. 131 1, and .sup. 186 Re are used, known currently available counting procedures maybe 
utilized. In the instance where the label is an enzyme, detection may be accomplished by 
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any of the preslj^ utilized colorimetric, spectxophotometric^J^rospectrophotometric, 

amperometric or gasometric techniques known in the art. 

The present invention includes an assay system which may be prepared in the form of a 
5 test kit for the quantitative analysis of the receptivity of a test subject for retroviral 
infection or to measuring conditions such as HIV-1 infection or the like by the extent of 
the presence of the Integration-interacting protein (INIP), or to identify drugs or other 
agents that may mimic or block their activity. The system or test kit may comprise a 
labeled component prepared by one of the radioactive and/or enzymatic techniques 
10 discussed herein, coupling a label to the Integration-interacting protein (INIP), their 
agonists and/or antagonists, and one or more additional immunochemical reagents, at 
least one of which is a free or immobilized ligand, capable either of binding with the 
labeled component, its binding partner, one of the components to be determined or their 
binding partner(s). 

15 

Particularly, the anti-Integration-interacting protein (INIP) antibody used in the diagnostic 
methods of this invention is an affinity purified polyclonal antibody. More preferably, the 
antibody is a monoclonal antibody (mAb). In addition, it is preferable for the anti- 
Integration-interacting protein (INIP) antibody molecules used herein be in the form of 
20 Fab, Fab\ F(ab f ).sub.2 or F(v) portions of whole antibody molecules. 



As suggested earlier, the diagnostic method of the present invention comprises examining 
a cellular sample or medium by means of an assay including an effective amount of an 
antagonist to a Integration-interacting protein (INIP)/protein, such as an anti-Integration- 

25 interacting protein (INIP) antibody, preferably an affinity-purified polyclonal antibody, 
and more preferably a mAb. In addition, it is preferable for the anti-Integration- 
interacting protein (INIP) antibody molecules used herein be in the form of Fab, Fab', 
F(ab').sub.2 or F(v) portions or whole antibody molecules. As previously discussed, 
patients capable of benefiting from this method include those suffering from an HIV-1 

30 infection or other like pathological derangement. Methods for isolating the Integration- 
interacting protein (INIP) and inducing anti-Integration-interacting protein (INIP) 
antibodies and for determining and optimizing the ability of anti-Integration-interacting 
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protein (IMP) anti^^es to assist in the examination of the targe^^s are all well-known 
in the art. 



The possibilities both diagnostic and therapeutic that are raised by the existence of the 
5 Integration-interacting protein (INIPs), derive from the fact that the factors appear to 
participate in direct and causal protein-protein interaction between the Integration- 
interacting protein (INIPs), and those factors that thereafter interact with IN and thereby 
increase inetgration. As suggested earlier and elaborated further on herein, the present 
invention contemplates pharmaceutical intervention in the cascade of reactions in which 
10 the Integration-interacting protein (INIPs) are implicated, to modulate the activity 
initiated by the Integration-interacting protein (INIPs). 

Thus, in instances where it is desired to reduce or inhibit the (resulting from a particular 
stimulus or factor, an appropriate inhibitor of the Integration-interacting protein (INIP) 
could be introduced to block the interaction of the Integration-interacting protein (INIP) 
1 5 with IN thereby affecting its strand-transfer activity. 

The present invention also relates to a variety of diagnostic applications, including 
methods for detecting the presence of stimuli such as the earlier discussed INIP, by 
reference to their ability to elicit the specific activities which are mediated by the 
20 Integration-interacting protein (INIP). 

As mentioned earlier, the Integration-interacting protein (INIP) or the protein DSUP 
interacts with can be used to produce antibodies to by a variety of known techniques (e.g. 
hybridoma techniques), and such antibodies could then be isolated and utilized as in tests 
25 for determining the presence of particular Integration-interacting protein (INIP) activity in 
suspect target cells. For convenience, the antibody(ies) to the Integration-interacting 
protein (INIP) will be referred to herein as Ab.sub.l and antibody(ies) raised in another 
species as Ab.sub.2. 

30 The presence of Integration-interacting protein (INIP) in cells can be ascertained by the 
usual immunological procedures applicable to such determinations. A number of useful 
procedures are known. Three such procedures which are especially usefull utilize either 
the Integration-interacting protein (INIP) labeled with a detectable label, antibody 

40 



WO 2004/029246 



PCT/BE2003/000164 



Ab.sub.2. labi 




ith a detectable label, or antibody Ab.sul 



tbeled with a detectable 



label. The procedures may be summarized by the following equations wherein the asterisk 
indicates that the particle is labeled, and IMP stands for the Integration-interacting 
protein: 

5 A. INIP*+Ab.sub.l IMP* Ab.sub.l 

B. IMP+Ab*=TIP Ab.sub.l * 

C. INP+Ab.sub.l +Ab.sub.2 *=TIP Ab.sub.l Ab.sub.2 * 

The procedures and their application are all familiar to those skilled in the art and 
10 accordingly may be utilized within the scope, of the present invention. The "competitive" 
procedure, Procedure A, is described in U.S. Pat. Nos. 3,654,090 and 3,850,752. 
Procedure C, the "sandwich" procedure, is described in U.S. Pat. Nos. RE 31,006 and 
4,016,043. Still other procedures are known such as the "double antibody," or "DASP" 
procedure. 

15 In each instance, the Integration-interacting protein (IMP) forms complexes with one or 
more antibody(ies) or binding partners and one member of the complex is labeled with a 
detectable label. The fact that a complex has formed and, if desired, the amount thereof, 
can be determined by known methods applicable to the detection of labels. 
It will be apparent from the above, that a characteristic property of Ab.sub.2 is that it will 

20 react with Ab.sub.l. This is because Ab.sub.l raised in one mammalian species has been 
used in another species as an antigen to raise the antibody Ab.sub.2. For example, 
Ab.sub.2 may be raised in goats using rabbit antibodies as antigens. Ab.sub.2 therefore 
would be anti-rabbit antibody raised in goats. For purposes of this description and claims, 
Ab.sub.l will be referred to as a primary or anti-integration-interacting protein (IMP) 

25 antibody, and Ab.sub.2 will be referred to as a secondary or anti- Ab.sub.l antibody. 

The labels most commonly employed for these studies are radioactive elements, enzymes, 
chemicals which fluoresce when exposed to ultraviolet light, and others. 
Radioactive labels can be detected by any of the currently available counting procedures. 
30 The preferred isotope may be elected from 3H, .sup. 14 C, .sup. 32 P, .sup. 35 S, .sup. 36 CI, 
.sup. 51 Cr, .sup. 57 Co, .sup. 58 Co, .sup. 59 Fe, .sup. 90 Y, .sup. 125 1, .sup. 131 1, and .sup. 186 Re. 
Enzyme labels are likewise useMl, and can be detected by any of the presently utilized 
calorimetric, spectrophotometric, fluorospectrophotometric, amperometric or gasometric 
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techniques. The enzyme is conjugated to the selected particle b^^iction with bridging 
molecules such as carbodiimides, diisocyanates, glutaraldehyde and the like. Many 
enzymes which can be used in these procedures are known and can be utilized. The 
preferred are peroxidase, .beta.-glucuronidase, .beta.-D-glucosidase, .beta.-D- 
5 galactosidase, urease, glucose oxidase plus peroxidase and alkaline phosphatase. U.S. Pat. 
Nos. 3,654,090; 3,850,752; and 4,016,043 are referred to by way of example for their 
disclosure of alternate labeling material and methods. 

A number of fluorescent materials are known and can be utilized as labels. These include, 
for example, fluorescein, rhodamine, auramine, Texas Red, AMCA blue and Lucifer 
10 Yellow. A particular detecting material is anti-rabbit antibody prepared in goats and 
conjugated with fluorescein through an isothiocyanate. 



A particular assay system developed and utilized in accordance with the present 
invention, is known as a receptor assay. In a receptor assay, the material to be assayed is 
15 appropriately labeled and then certain cellular test colonies are inoculated with a quantity 
of both the labeled and unlabeled material after which binding studies are conducted to 
determine the extent to which the labeled material binds to the cell receptors. In this way, 
differences in affinity between materials can be ascertained. 



20 Accordingly, a purified quantity of the Integration-interacting protein (INIP) may be 
radiolabeled and combined, for example, with antibodies or other inhibitors thereto, after 
which binding studies would be carried out. Solutions would then be prepared that 
contain various quantities of labeled and unlabeled uncombined Integration-interacting 
protein (IMP), and cell samples would then be inoculated and thereafter incubated. The 

25 resulting cell monolayers are then washed, solubilized and then counted in a gamma 
counter for a length of time sufficient to yield a standard error of <5%. These data are 
then subjected to Scatchard analysis after which observations and conclusions regarding 
material activity can be drawn. While the foregoing is exemplary, it illustrates the manner 
in which a receptor assay may be performed and utilized, in the instance where the 

30 cellular binding ability of the assayed material may serve as a distinguishing 
characteristic. 
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An assay use] 




contemplated i: 



in accordance with the 




invention is known as a 



"cis/trans" assay. Briefly, this assay employs two genetic constructs, one of which is 
typically a plasmid that continually expresses a particular receptor of interest when 
transfected into an appropriate cell line, and the second of which is a plasmid that 
expresses a reporter such as luciferase, under the control of a receptor/ligand complex. 
Thus, for example, if it is desired to evaluate a compound as a ligand for a particular 
receptor, one of the plasmids would be a construct that results in expression of the 
receptor in the chosen cell line, while the second plasmid would possess a promoter 
linked to the luciferase gene in which the response element to the particular receptor is 
inserted. If the compound under test is an agonist for the receptor, the ligand will complex 
with the receptor, and the resulting complex will bind the response element and initiate 
transcription of the luciferase gene. The resulting chemiluminescence is then measured 
photometrically, and dose response curves are obtained and compared to those of known 
ligands. The foregoing protocol is described in detail in U.S. Pat. No. 4,981,784 and PCT 
International Publication No. WO 88/03168, for which purpose the artisan is referred. 

In a further embodiment of this invention, commercial test kits suitable for use by a 
medical specialist may be prepared to determine the presence or absence of 
predetermined Integration-interacting protein (INIP) activity or predetermined 
Integration-interacting protein (INIP) activity capability in suspected target cells. In 
accordance with the testing techniques discussed above, one class of such kits will 
contain at least the labeled Integration-interacting protein (INIP) or its binding partner, for 
instance an antibody specific thereto, and directions, of course, depending upon the 
method selected, e.g., " competitive, " "sandwich," "DASP" and the like. The kits may also 
contain peripheral reagents such as buffers, stabilizers, etc. 

Accordingly, a test kit may be prepared for the demonstration of the presence or 
capability of cells for predetermined Integration-interacting protein (INIP) activity, 
comprising: 

a) a predetermined amount of at least one labeled immunochemical^ reactive component 
obtained by the direct or indirect attachment of the present Integration-interacting protein 
(MOP) factor or a specific binding partner thereto, to a detectable label; 

b) other reagents; and 
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More specifically, the diagnostic test kit may comprise: 

a) a known amount of the Integration-interacting protein (INIP) as described above (or a 
5 binding partner) generally bound to a solid phase to form an immunosorbent, or in the 

alternative, bound to a suitable tag, or plural such end products, etc. (or their binding 
partners) one of each; 

b) if necessary, other reagents; and 

c) directions for use of said test kit. 

10 

In a further variation, the test kit may be prepared and used for the purposes stated above, 
which operates according to a predetermined protocol (e.g. "competitive," "sandwich," 
"double antibody," etc.), and comprises: 

a) a labeled component which has been obtained by coupling the Integration-interacting 
1 5 protein (INIP) to a detectable label; 

b) one or more additional immunochemical reagents of which at least one reagent is a 
ligand or an immobilized ligand, which ligand is selected from the group consisting of: 

(i) a ligand capable of binding with the labeled component (a); 

(ii) a ligand capable of binding with a binding partner of the labeled component (a); 
20 (iii) a ligand capable of binding with at least one of the component(s) to be 

determined; and 

(iv) a ligand capable of binding with at least one of the binding partners of at least 
one of the component(s) to be determined; and 

c) directions for the performance of a protocol for the detection and/or determination of 
25 one or more components of an immunochemical reaction between the Integration- 
interacting protein (INIP) and a specific binding partner thereto. 

In a specific ambodiment, the present invention also relates to a method of increasing the 
integration of a certain polycucleotides. For this reason Inip proteins, more in particular 
30 proteins of the family of the hepatoma-derived growth factors, yet more in particular 
Inip76, analogs, derivatives or fragments thereof, polynucleotides or vectors expressing 
them can be used. This can be used in combination with any method of gene therapy. 
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In a particul; 




Lbodiment, the Integrase-interacting 




?ins with a sequence 



substantially homologous to LEDGF, their antibodies, agonists, antagonists, modulators 
or active fragments thereof, could be prepared in pharmaceutical formulations or 
compositions for administration in instances wherein anti-HIV therapy is appropriate. 
Such compounds may thus be prepared in pharmaceutical compositions, with a suitable 
carrier and at a strength effective for administration by various means to a patient 
experiencing an adverse medical condition associated with HIV-1 infection for the 
treatment thereof. A variety of administrative techniques may be utilized, among them 
parenteral techniques such as subcutaneous, intravenous and intraperitoneal injections, 
catheterizations and the like. Average quantities of the antagonist of the Integration- 
interacting protein (INIP) or their subunits may vary and in particular should be based 
upon the recommendations and prescription of a qualified physician or veterinarian. The 
formulations include those suitable for oral, rectal, nasal, topical (including buccal and 
sublingual), vaginal or parenteral (including subcutaneous, intramuscular, intravenous, 
intradermal, intrathecal and epidural) administration. The formulations may conveniently 
be presented in unit dosage form and may be prepared by any of the methods well known 
in the art of pharmacy. Such methods include the step of bringing into association the 
active ingredient with the carrier which constitutes one or more accessory ingredients. In 
general the formulations are prepared by uniformly and intimately bringing into 
association the active ingredient with liquid carriers or finely divided solid carriers or 
both, and then, if necessary, shaping the product. 

The present invention further contemplates therapeutic compositions useful in practicing 
the therapeutic methods of this invention. A subject therapeutic composition includes, in 
admixture, a pharmaceutically acceptable excipient (carrier) and one or more of a 
Integration-interacting protein (INIP), polypeptide analog thereof or fragment thereof, as 
described herein as an active ingredient. In a preferred embodiment, the composition 
comprises an antigen capable of modulating the specific binding of the present 
Integration-interacting protein (INIP ) within a target cell. 

The present invention also relates to pharmaceutical compositions including one or more 
compounds selected from the molecules (antagonists or modulators) which comprise a 
region specifically binding to a protein of the hepatoma-derived growth factor family or 
nucleic acids encoding said protein of the hepatoma-derived growth factor family, for the 
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manufacture of a 




icament for the prevention and/or treal 




of a viral infection, 



more particularly selected from antibodies, antisens oligonucleotides, ribozymes, antigene 
therapeutics, small molecules, variant polypeptides, small interfering RNAs or vectors 
used for RNA interference. 



The preparation of therapeutic compositions which contain polypeptides, analogs or 
active fragments, other antagonist or modulators as active ingredients is well understood 
in the art. Such compositions can be prepared as injectables, either as liquid solutions or 
suspensions, however, solid forms suitable for solution in, or suspension in, liquid prior to 

10 injection can also be prepared. The preparation can also be emulsified. The active 
therapeutic ingredient is often mixed with excipients which are pharmaceutically 
acceptable and compatible with the active ingredient. Suitable excipients are, for 
example, water, saline, dextrose, glycerol, ethanol, or the like and combinations thereof. 
In addition, if desired, the composition can contain minor amounts of auxiliary substances 

15 such as wetting or emulsifying agents, pH buffering agents which enhance the 
effectiveness of the active ingredient. 

A polypeptide, analog or active fragment can be formulated into the therapeutic 
composition as neutralized pharmaceutically acceptable salt forms. Pharmaceutically 

20 acceptable salts include the acid addition salts (formed with the free amino groups of the 
polypeptide or antibody molecule) and which are formed with inorganic acids such as, for 
example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, 
tartaric, mandelic, and the like. Salts formed from the free carboxyl groups can also be 
derived from inorganic bases such as, for example, sodium, potassium, ammonium, 

25 calcium, or ferric hydroxides, and such organic bases as isopropylamine, trimethylamine, 
2-ethylamino ethanol, histidine, procaine, and the like. 

The therapeutic polypeptide-, analog- or active fragment-containing compositions are 
conventionally administered intravenously, as by injection of a unit dose, for example. 
30 The term "unit dose" when used in reference to a therapeutic composition of the present 
invention refers to physically discrete units suitable as unitary dosage for humans, each 
unit containing a predetermined quantity of active material calculated to produce the 
desired therapeutic effect in association with the required diluent; i.e., carrier, or vehicle. 
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The compositions are administered in a manner compatible with the dosage formulation, 
and in a therapeutically effective amount. The quantity to be administered depends on the 
subject to be treated, capacity of the subject's immune system to utilize the active 
ingredient, and degree of inhibition or neutralization of Integration-interacting protein 
(INIP) binding capacity desired. Precise amounts of active ingredient required to be 
administered depend on the judgment of the practitioner and are peculiar to each 
individual. However, suitable dosages may range from about 0.1 to 20, preferably about 
0.5 to about 10, and more preferably one to several, milligrams of active ingredient per 
kilogram body weight of individual per day and depend on the route of administration. 
Suitable regimes for initial administration and booster shots are also variable, but are 
typified by an initial administration followed by repeated doses at one or more hour 
intervals by a subsequent injection or other administration. Alternatively, continuous 
intravenous infusion sufficient to maintain concentrations of ten nanomolar to ten 
micromolar in the blood are contemplated. 

Suitable pharmaceutical carriers for use in said pharmaceutical compositions and their 
formulation are well known to those skilled in the art, and there is no particular restriction 
to their selection within the present invention. Suitable carriers or excipients known to the 
skilled man are saline, Ringer's solution, dextrose solution, Hank's solution, fixed oils, 
ethyl oleate, 5% dextrose in saline, substances that enhance isotonicity (such as sugars or 
sodium chloride) and chemical stability, buffers and preservatives. Other suitable carriers 
include any carrier that does not itself induce the production of antibodies harmful to the 
individual receiving the composition such as proteins, polysaccharides, polylactic acids, 
polyglycolic acids, polymeric amino acids and amino acid copolymers. They may also 
include additives such as wetting agents, dispersing agents, stickers, adhesives, 
emulsifying agents, solvents, coatings, antibacterial and antifungal agents (for example 
phenol, sorbic acid, chlorobutanol) and the like, provided the same are consistent with 
pharmaceutical practice, i.e. carriers and additives which do not create permanent damage 
to mammals. The pharmaceutical compositions of the present invention may be prepared 
in any known manner, for instance by homogeneously mixing, coating and/or grinding 
the active ingredients, in a one-step or multi-steps procedure, with the selected carrier 
material and, where appropriate, the other additives such as surface-active agents may 
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microspheres usually having a diameter of about 1 to 10 nm, namely for the manufacture 
of microcapsules for controlled or sustained release of the active ingredients. 

Suitable surface-active agents to be used in the phannaceutical compositions of the 
present invention are non-ionic, cationic and/or anionic materials having good 
emulsifying, dispersing and/or wetting properties. Suitable anionic surfactants include 
both water-soluble soaps and water-soluble synthetic surface-active agents. Suitable soaps 
are alkaline or alkaline-earth metal salts, unsubstituted or substituted ammonium salts of 
higher fatty acids (CIO-C22), e.g. the sodium or potassium salts of oleic or stearic acid, or 
of natural fatty acid mixtures obtainable form coconut oil or tallow oil. Synthetic 
surfactants include sodium or calcium salts of polyacrylic acids; fatty sulphonates and 
sulphates; sulphonated benzimidazole derivatives and alkylarylsulphonates. Fatty 
sulphonates or sulphates are usually in the form of alkaline or alkaline-earth metal salts, 
unsubstituted ammonium salts or ammonium salts substituted with an alkyl or acyl radical 
having from 8 to 22 carbon atoms, e.g. the sodium or calcium salt of lignosulphonic acid 
or dodecylsulphonic acid or a mixture of fatty alcohol sulphates obtained from natural 
fatty acids, alkaline or alkaline-earth metal salts of sulphuric or sulphonic acid esters 
(such as sodium lauryl sulphate) and sulphonic acids of fatty alcohol/ethylene oxide 
adducts. Suitable sulphonated benzimidazole derivatives preferably contain 8 to 22 
carbon atoms. Examples of alkylarylsulphonates are the sodium, calcium or alcanolamine 
salts of dodecylbenzene sulphonic acid or dibutyl-naphtalenesulphonic acid or a 
naphtalene-sulphonic acid/formaldehyde condensation product. Also suitable are the 
corresponding phosphates, e.g. salts of phosphoric acid ester and an adduct of p- 
nonylphenol with ethylene and/or propylene oxide, or phospholipids. Suitable 
phospholipids for this purpose are the natural (originating from animal or plant cells) or 
synthetic phospholipids of the cephalin or lecithin type such as e.g. 
phosphatidylethanolamine, phosphatidylserine, phosphatidylglycerine, lysolecithin, 
cardiolipin, dioctanylphosphatidyl-choline, dipalmitoylphoshatidyl -choline and their 
mixtures. 

Suitable non-ionic surfactants include polyethoxylated and polypropoxylated derivatives 
of alkylphenols, fatty alcohols, fatty acids, aliphatic amines or amides containing at least 
12 carbon atoms in the molecule, alkylarenesulphonates and dialkylsulphosuccinates, 
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such as polygly^^her derivatives of aliphatic and cycloalipl^^alcohols, saturated and 
unsaturated fatty acids and alkylphenols, said derivatives preferably containing 3 to 10 
glycol ether groups and 8 to 20 carbon atoms in the (aliphatic) hydrocarbon moiety and 6 
to 18 carbon atoms in the allcyl moiety of the alkylphenol. Further suitable non-ionic 
5 surfactants are water-soluble adducts of polyethylene oxide with poylypropylene glycol, 
ethylenediaminopolypropylene glycol containing 1 to 10 carbon atoms in the alkyl chain, 
which adducts contain 20 to 250 ethyleneglycol ether groups and/or 10 to 100 
propyleneglycol ether groups. Such compounds usually contain from I to 5 ethyleneglycol 
units per propyleneglycol unit. Representative examples of non-ionic surfactants are 

10 nonylphenol -polyethoxyethanol, castor oil polyglycolic ethers, polypropylene/ 
polyethylene oxide adducts, tributylphenoxypolyethoxyethanol, polyethyleneglycol and 
octylphenoxypolyethoxyethanol. Fatty acid esters of polyethylene sorbitan (such as 
polyoxyethylene sorbitan trioleate), glycerol, sorbitan, sucrose and pentaerythritol are 
also suitable non-ionic surfactants. 

15 Suitable cationic surfactants include quaternary ammonium salts, preferably halides, 
having 4 hydrocarbon radicals optionally substituted with halo, phenyl, substituted phenyl 
or hydroxy; for instance quaternary ammonium salts containing as N-substituent at least 
one C8C22 alkyl radical (e.g. cetyl, lauryl, palmityl, myristyl, oleyl and the like) and, as 
further substituents, unsubstituted or halogenated lower alkyl, benzyl and/or hydroxy- 

20 lower alkyl radicals. 

A more detailed description of surface-active agents suitable for this purpose may be 
found for instance in "McCutcheon's Detergents and Emulsifiers Annual" (MC Publishing 
Crop., Ridgewood, New Jersey, 1981), "Tensid-Taschenbucw', 2 d ed. (Hanser Verlag, 
Vienna, 1981) and "Encyclopaedia of Surfactants, (Chemical Publishing Co., New York, 

25 1981). 

Additional ingredients may be included in order to control the duration of action of the 
active ingredient in the composition. Control release compositions may thus be achieved 
by selecting appropriate polymer carriers such as for example polyesters, polyamino 
acids, polyvinyl pyrrolidone, ethylene-vinyl acetate copolymers, methylcellulose, 
30 carboxymethylcellulose, protamine sulfate and the like. The rate of drug release and 
duration of action may also be controlled by incorporating the active ingredient into 
particles, e.g. microcapsules, of a polymeric substance such as hydrogels, polylactic acid, 
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hyfroxymethylcellurose, polyniethyl methacrylate and the oiner above-described 
polymers. 

Such methods include colloid drug delivery systems like liposomes, microspheres, 
microemulsions, nanoparticles, nanocapsules and so on. Depending on the route of 
5 administration, the pharmaceutical composition may require protective coatings. 
Pharmaceutical forms suitable for injectionable use include sterile aqueous solutions or 
dispersions and sterile powders for the extemporaneous preparation thereof. Typical 
carriers for this purpose therefore include biocompatible aqueous buffers, ethanol, 
glycerol, propylene glycol, polyethylene glycol and the like and mixtures thereof. 



The therapeutic compositions may include an effective amount of the Integration- 
interacting protein (INIP) antagonist, and other active ingredients. Exemplary 
formulations are given below: 



10 



15 



Formulations 



Ingredient 



mg/ml 



Intravenous Fonnulatiofi I 



20 



cefotaxime 250.0 



Integration-interacting protein (IMP) antagonist 10.0 
dextrose USP 45.0 



sodium bisulfite USP 3.2 



edetate disodium USP 0.1 



25 



water for injection q.s.a.d. 1.0 ml 



Intravenous Formulation II 
ampicillin 250.0 

Integration-interacting protein (INIP) antagonist 10.0 
sodium bisulfite USP 3.2 



30 



disodium edetate USP 0.1 



water for injection q.s.a.d. 1.0 ml 



Intravenous Formulation III 



gentamicin (charged as sulfate) 40.0 
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Integra^fenteracting protein (INIP) antagonist 10,0 

sodium bisulfite USP 3.2 

disodium edetate USP 0.1 

water for injection q.s.a.d. 1.0 ml 
5 Intravenous Formulation W 

Integration-interacting protein (INIP) antagonist 10.0 

dextrose USP 45.0 

sodium bisulfite USP 3.2 

edetate disodium USP 0.1 
10 water for injection q.s.a.d. 1.0 ml 

Intravenous Formulation V 

Integration-interacting protein (INIP) antagonist 5.0 

sodium bisulfite USP 3.2 

disodium edetate USP 0.1 
15 water for injection q.s.a.d. 1 .0 ml 



The minimal nuclear HIV-1 IN complex is a homotetramer 

20 A serious obstacle in working with recombinant retroviral integrases is their poor 
solubility and propensity for aggregation. When produced in bacteria, HIV IN has to be 
extracted from inclusion bodies; detergents are used to stabilize the enzyme in solution. 
All crystal structure and some in vitro multimerization studies have been done with the 
soluble mutants. Furthermore, recent reports raised concern that the stoichiometry and 

25 enzymatic activities of recombinant IN can be affected by the enzyme preparation (Leh et 
al., 2000; Sinha et al., 2002). The goal of the research was to study protein complexes that 
HIV-1 IN forms within nuclei of human cells. The bulk of HIV-1 IN present in 293T cells 
that stably produce this viral protein, is associated with the insoluble nuclear fraction. 
Although IN seems to be directly or indirectly bound to chromosomal DNA, this may not 

30 be the only factor in nuclear retention of IN, since digestion of the detergent- 
permeabilized nuclei with nucleases was not sufficient to elute IN. This study was 
directed on IN complexes extracted from the detergent-permeabilized nuclei in hypertonic 
conditions. The present invention relates to the existence of the salt-eluted IN as part of a 
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distinct 61 A co] 




which is not stable in diluted nuclear 




its and dissociates, 



releasing a 41 A core molecule. According to the cross-linking and gel filtration data, the 
present invention also relates to the 41 A molecule being a tetramer of EN. Indeed, the 
complex exists in immunoprecipitated INf preparations. Its molecular weight determined 
5 both by gel filtration and cross-linking (115-150 kDa) is close to the value expected for 
the presumed tetramer (128 kDa). Finally, partial cross-linking of the diluted nuclear 
extracts and of the purified tagged IN protein produced bands at the positions 
corresponding to the IN monomer, dimer and trimer, in addition to pl50 cl (data not shown 
and Figure 2A, lane 7). It was estimated that the concentrations of IN and INf in the 
10 nuclear extracts did not exceed lOnM (<0.3^g/ml IN at lOO^ig/ml total protein). 
Therefore, the IN tetramer was stable even at subnanomolar concentrations (i.e. in the 
extracts diluted to 4 |ig/ml of total protein), showing that the minimal nuclear IN complex 
is a tetramer. 

1 5 Composition of the 61 A HIV-1 IN complex 

The native 61 A IN complex was found to decay during prolonged incubations and not to 
be stable during purification. When the immunoprecipitation time was shortened and 
higher initial extract concentrations were used, a cellular protein with an apparent MW of 
76 kDa specifically co-precipitated with the epitope-tagged IN. The protein, or Inip76, as 

20 we refer to it, was found to be identical to human LEDGF/DFS70/p75 protein. It was 
further confirmed that Inip76/LEDGF is a part of the native 61 A IN complex, since 
addition of a monoclonal anti-LEDGF antibody shifted elution of the complex from a gel 
filtration column. Importantly, when LEDGF protein was present, the p300 cl band became 
visible after DTSSP cross-linking of the immunoprecipitated INf. This band could not be 

25 reproduced in LEDGF-negative INf samples obtained following the long 
immunoprecipitation protocol. P300 cl seems to be the result of partial cross-linking of the 
61 A IN complex for the following three reasons: (i) the MW was lower than the expected 
value for a 61 A globular complex («400kDa); (ii) DTSSP cross-linking products with 
apparent MWs higher than that of p300 cl were visible on anti-IN immunoblots; (Hi) this 

30 product did not react with anti-LEDGF antibody. Based on its apparent molecular weight, 
we speculate that p300 cl probably represents two cross-linked IN tetramers. 
The exact stoichiometry of the IN-Inip76 complex was investigated. At the concentrations 
it is present in nuclear extracts, the 61 A complex was not stable enough to allow 



WO 2004/029246 PO7BE2003/000164 

measurement o^^sedimentation coefficient, which is requi^^o determine its precise 
MW (Siegel and Monty, 1965). Assuming that the 61 A complex is globular, it can be 
estimated that its MW is around 400 kDa. The simplest model compatible with this MW 
is of a symmetrical complex containing a pair of IN tetramers and two subunits of Inip76, 
5 which corresponds to a macromolecule of 370 kDa. In a particular embodiment of the 
invention, the native 61 A complex contains another cellular protein. 

Retroviral IN within PIC: a dimer of dimers or a dimer of tetramers? 

During reverse transcription, the two retroviral cDNA termini are not completed 

10 simultaneously and both seem to be substrates for the 3'-processing activity of IN as soon 
as they appear (Miller et al., 1997). Moreover, at least in case of HIV, 3 '-processing of 
one LTR end was observed when the second end was non-functional and also did not 
support normal intasome assembly (Chen and Engelman, 2001). On the other hand, two 
functional LTRs were found to be required for strand transfer activity of isolated HIV 

15 PICs. Therefore, although LTRs can be processed asymmetrically, a synaptic complex 
involving both LTRs must form to allow strand transfer, ensuring that only legitimate 
integration of both retroviral cDNA ends occurs. Based on a comparison to Mu phage 
transposase and available crystal structure data, it has been suggested that the active form 
of retroviral IN is a tetramer (a dimer of dimers) (Wang et al., 2001). The results of this 

20 research show that HIV IN expressed in human cells is indeed present as a stable 
tetramer. Recombinant HIV-1 IN as well has been recently reported to form stable 
tetramers in diluted solutions (Leh et al., 2000). Intriguingly, both Mu phage and Tn5 
transposases form active multimers (tetramers and dimer respectively) only within their 
synaptic complexes. Thus, independent transposase subunits must first bind to the ends of 

25 the transposon genome, before being brought together to form the synaptic complex. 
Extrapolating this scheme to the retroviral PIC assembly, the stable IN tetramers can be 
looked at as such independent subunits, which first have to bind to one LTR end each 
before interacting with each other. According to this model, each individual tetramer 
would be capable of carrying out 3'-processing reaction, whereas a dimer of tetramers 

30 would be necessary to accomplish the strand transfer. The data presented here show that 
the cellular HIV-1 IN 61 A complex contains two IN tetramers that are close enough to be 
cross-linked to each other. This dimer of tetramers could represent the configuration of IN 
required for concerted integration. 
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What is the role ofInip76 in retroviral replication? 

The human Inil/hSNF5/BAF47 protein has been shown to interact with HTV-1 IN in a 
yeast two hybrid screen and in vitro. Neither co-immunoprecipitation of Inil with IN 

5 from nuclear extracts was observed; nor the co-localization of the two proteins in the cell 
was detected. Therefore, it can be stipulated that the cellular EN-Inil interaction might be 
transient and only occurring during certain steps of viral replication. The present 
invention shows that IN complexes extracted from the cell lines stably expressing HIV-1 
IN contained another human protein, Inip76, identical to LEDGF/DFS70/p75. JN f and 

10 Inip76 co-localized in the nuclei of 293T-IN s alaFLAG cells, confirming that the observed 
interaction was not an artifact of the extraction. So far, there has been no previous account 
of a link between LEDGF/DFS70/p75 and retroviral replication. No obvious relation 
seems to exist between this protein and known chromatin remodeling factors. 
Based on sequence similarity, Inip76/LEDGF/DFS70/p75 is a member of the hepatoma- 

15 derived growth factor (HDGF) family that includes HDGF and several other HDGF- 
related proteins (HRPs) (reviewed in (Dietz et al., 2002)). High degree of homology 
exists between the amino terminal regions of these proteins. The PWWP motif (70 
residues containing the Pro-Trp-Trp-Pro core sequence) is located within the amino 
terminal homology region of HRPs and relates them to a larger and functionally diverse 

20 nuclear protein family, that includes DNA-binding transcription factors and enzymes 
involved in DNA repair and DNA methylation. PWWP domains are thought to be 
implicated in protein-protein interactions (Stec et al., 2000). 

An eye-catching feature of Inip76 is the abundance of the charged amino acids 
comprising approximately 40% of the total residues. Screening its sequence against the 

25 BLOKS+ protein motif database (Henikoff et al., 1999) revealed fragments with 
similarity to the HMG-I(Y) DNA AT hook sequence (data not shown). LEDGF has been 
shown to be a DNA-binding protein with affinity for heat shock and stress-related DNA 
elements (HSE and STREs) (Singh et al., 2001). It has also been reported to interact with 
components of the general transcription machinery and with the transcription activation 

30 domain of VP 16 (Ge et al., 1998). Recent reports suggested that LEDGF plays an 
important role in regulating expression of the stress-response genes (Fatma et al., 2001; 
Shinohara et al., 2002). As mentioned above, alternative splicing of LEDGF pre-mRNA, 
allows expression of the second protein, p52, from the same gene. A growing body of 
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evidence sugge^^at p75 and p52 may have different fimctim^Mthough they both can 
interact with PC4, VP 16 and general transcription factors, at least in vitro, p52 displays 
higher transcription activation activity (Ge et al., 1998). In addition, p52 and not p75 has 
been shown to functionally interact with the ASF/SF2 splicing factor in vitro (Ge et al, 
5 1998). The proteins also differ in their nuclear distribution patterns (Nishizawa et al., 
2001). Intracellular levels of p52 appear to be much lower than those of p75, at least in 
the cell lines we have tested (HEK-293, 293T, HeLa, CEM) (data not shown). We have 
not detected co-immunoprecipitation of p52 with IN from the nuclear cell lysates of IN- 
expressing 293T cells. However, it remains to be determined whether p52 is able to 
10 interact with HTV IN. 



What is the role of the Inip76-IN interaction in the retroviral life cycle? 

The fact that recombinant Inip76 was able to stimulate HIV-1 IN activity in vitro shows a 

direct involvement of Inip76 in the integration process. As a chromosome-associated IN- 

15 binding protein, Inip76 could serve as a docking factor for the PIC. It might thus be 
functionally similar to the Mu phage transposition co-factor MuB that associating with 
the acceptor DNA makes it a preferred target for transposition. It is also conceivable that 
Inip76 might be a part of the retroviral PIC. The experiments show that Inip76 is an 
essential factor in the HIV-replication and therefore Inip76 constitutes a novel target for 

20 anti-retroviral therapy directed against its interaction with IN. Alternatively, a therapeutic 
strategy based on modified Inip76 protein, designed to capture the viral IN in a 
catalytically quiescent complex, is also part of the invention. 



25 Examples 

Methods 

Example 1 Recombinant DNA 
30 The HIV-1 integrase expression constructs were based on the episomal pCEP4 vector 
(Invitrogen, Groningen, The Netherlands). The plasmid pCEP-EsPala is identical to the 
published pCEP-IN s plasmid (Cherepanov et al., 2000) with the difference that the Gly 
codon in the second position of the synthetic open reading frame (ORF) was mutated to 
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Ala. As a result, t^^onstruct expressed native HIV-1 IN with^^ddition of Met- Ala 
dipeptide at the N-terminus. To create the FLAG epitope-tagged IN expression construct 
pCEP-DS^alaFLAG, the IN synthetic ORF from pCEP-IN^ala was amplified in two 
consecutive steps with the sense primer S'-GGCTAGATATCACTAGCAAC 
5 CTCAAACAG plus two anti-sense primers 5 * -GTCGTCCTTGTAATCGCCGTCC 
TCATCTTGACGAGAG and 5 '-GGCGCTCGAGTTACTTGTCATCGTCGTCCTTGT 
AATCGC, the resulting PGR fragment was digested with Xhol and cloned between the 
EcoRI (blunt) and Xhol sites of the pCEP4 vector. This plasmid expressed HIV-1 IN 
carrying the C-terminal FLAG epitope (DYKDDDDK). The plasmid pRP1012, for 

10 bacterial expression of His 6 -tagged HIV-1 IN, was a gift of Dr. R. Plasterk (The 
Netherlands). To obtain pCP6H75, the plasmid used for bacterial expression of His 6 - 
tagged Inip76/LEDGF, the PCR fragment amplified from a HeLa cDNA with the primers 
5 '-GGCCGGATCCGACTCGCGATTTCAAACCTGGAGAC and 5'-CCGCGAATTCT 
AGTTATCTAGTGTAGAATCCTTC was digested with BamHI and EcoRI and cloned 

15 between the BamHI and EcoRI sites of pRSETB (Invitrogen). To prepare the mini-HIV 
IN substrate, the plasmid pU3U5 (Cherepanov et al., 1999) was digested with Seal. 



Example 2: Cell Culture 

The human embryonic kidney cells expressing SV40 large T antigen, 293T were obtained 
20 from Dr. O. Danos (Evry, France). Cells were grown in Dulbecco's modified Eagle's 
medium (DMEM) (Invitrogen) supplemented with 10% fetal calf serum (FCS), 2mM 
glutamine and 20 (ig/ml gentamicin at 37°C in 5% C0 2 humidified atmosphere. To 
establish stable cell lines, 293T ceils were transfected by electroporation with integrase 
expression constructs and selected with 200 |xg/ml of hygromycin B (Invitrogen). The 
25 selected cell lines, 293T-IN s ala and 293T-IN s alaFLAG, expressed IN at the levels similar 
to the previously reported 293T-IN 5 cells (Cherepanov et al, 2000) as determined by 
western blotting and immunofluorescence. IN expression was stable for over 40 passages. 
For radioactive immunoprecipitation experiments cells were labeled in 
methionine/cysteine-free DMEM (Invitrogen) supplemented with 10% dialyzed FCS plus 
30 0.1 mCi/ml of TRAN 35 S-LABEL (ICN Biomedicals, Asse-Relegem, Belgium) for 
24 hours. 
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Example 3: Preparation of Nuclear Extracts 

293T-IN s ala or 293T-IN s alaFLAG cells grown to a confluency of 80-90% were 
dissociated from plastic dishes using trypsin, washed with PBS and re-suspended in the 
5 modified cytoskeleton buffer (10 mM Pipes [pH 6.8], 10% (w/v) sucrose, 1 mM DTT, 
1 mM MgCl 2 plus the EDTA-free protease inhibitor cocktail (Roche, Brussels, Belgium)) 
(Fujita et al., 1997) containing 100 mM NaCl (referred to as 100CSK buffer). Cells were 
lysed for 10 min on ice with 0.5% NP40; nuclei were pelleted and washed with 100CSK. 
To extract IN, nuclei were re-suspended in the 400CSK buffer (same as 100CSK, but 
10 containing 400 mM NaCl) and left on ice for 5 min, the chromatin was removed by 
centrifugation at 7,500 rpm for 2 min. The total protein content of the nuclear extracts 
was measured using the BCA protein assay (Pierce, Rockford, EL), with bovine serum 
albumin (BSA) as the standard. 

15 Example 4: Chemical Cross-linking 

The nuclear extracts were diluted using 400CSK buffer to adjust the total protein 
concentration to 100, 20 or 4 |ag/ml. DTSSP (Pierce, Erembodegem-Aalst, Belgium) was 
dissolved in water immediately prior to the experiment and used at 2, 0.5 or 0.1 mM. 
Cross-linking reactions were allowed to proceed for 15 min at room temperature, were 

20 terminated by addition of Va volume of the 4 x SDS sample buffer (200 mM Tris HC1 [pH 
6.8], 4% SDS and 40% (v/v) glycerol) and incubation at room temperature for 20 min. 

Example 5: Gel Filtration Chromatography 

Nuclear extracts and immunopurified BSf f were fractionated on a Superdex 200 HR 10/30 
25 gel filtration column (Amersham-Pharmacia). The 400CSK buffer was used in all 
chromatography experiments. The column was operated at 0.6ml/min, 4°C, and 
calibrated using low and high MW gel filtration standards from Amersham-Pharmacia 
(blue dextran; thyroglobulin, R s /Mw 85 A/669 kDa; ferritin, 61 A/440 kDa; catalase, 
52.2 A/232 kDa; aldolase, 48.1 A/158 kDa; BSA, 35.5 A/67 kDa; chymotrypsinogen A, 
30 20.9 A/25 kDa). The sample volume was kept at 200 jil; fractions of 300 yl were 
collected and analyzed by western blotting using polyclonal anti-IN antibodies. When 
necessary, gel filtration fractions were concentrated by precipitation with trichloroacetic 
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acid. Stokes radii (^^ind approximate MWs of the IN complex^^^re determined from 
their experimental partition coefficients (Ka V ) as described (Siegel and Monty, 1965). 



Example 6: Western Blotting and Immunoprecipitation 
5 The gradient 4-12 and 4-20% Novex Tris-Glycine gels were purchased from Livitrogen. 
Proteins were transferred onto PVDF membrane (Bio-Rad, Nazareth, Belgium); detection 
was done with ECL+ (Amersham-Pharmacia, Roosendaal, The Netherlands). The rabbit 
polyclonal anti-HIV-1 IN were home made and used at a final dilution of 1:30.000 
(Cherepanov et al., 2000). The anti-FLAG M2 monoclonal antibody was from Sigma- 

10 Aldrich, the monoclonal anti-DNA-PKcs Ab-4 cocktail from NeoMarkers (Fremont, CA) 
and the monoclonal anti-LEDGF p75/p52 from BD Biosciences (Erembodegem, 
Belgium). The affinity purified anti-hMCM3 polyclonal antibody was a gift from Dr. 
Knippers R. (Universty of Konstanz, Germany). A combination of prestained MW 
markers (New England Biolabs 9 Hertfordshire, UK) and Markl2 (Invitrogen) was used to 

15 estimate MWs of the cross-linking products and Inip76. DNAPKcs detected in a 293T 
nuclear lysate sample using the anti-DNAPKcs Ab4 antibody served as the 470 kDa 
marker in some western blots. 

In the initial immunoprecipitation experiments, 30 \x\ of protein G agarose 
(Roche) and 1-3 jig of the anti-FLAG M2 antibody was added to the nuclear extracts 

20 prepared in 400CSK and diluted to obtain total protein concentration of 200 |ig/ml; the 
suspension was strirred at 4°C overnight (12-18 hours). The agarose beads were washed 
once with 400CSK and 4 times with 100CSK plus 0.1% NP40. Protein was eluted in 
400CSK buffer by addition of 200^ig/ml FLAG peptide (Sigma- Aldrich) or in 
SDS PAGE sample buffer. To purifiy LEDGF-INf complexes, immunoprecipitation was 

25 carried out using undiluted nuclear extracts (700 jag/ml total protein) for 3-5 hours. To 
identify the Ihip76 protein by N-terminal sequencing and mass spectrometry, the 
procedure was upscaled. 293T-IN s alaFLAG cells were grown to confluency on five 
500 cm 2 dishes (VWR International, Leuven, Belgium), lysed with 0.5% NP40. IN 
complexes were extracted from the nuclear pellets into 13 ml of the 400CSK buffer and 

30 incubated with 300 jal of protein G agarose beads and 40 \ig of the anti-FLAG M2 
antibody for 4.5 hours. The IN f complexes were eluted in 700 j-il of the 400CSK buffer 
with 200 jig/ml FLAG peptide. 
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Example 7: Amm^erminal Sequencing and Mass Spectrome^^ 

Immunopurified INf-Inip76 complexes were precipitated with trichloroacetic acid and 
redissolved in the SDS PAGE sample buffer. Approximately 3 |ig of the Mp76 protein, 
electroblotted onto a Sequi-Blot PVDF membrane (Bio-Rad) from an SDS PAGE gel, 

5 was subjected to Edman degradation on a pulsed liquid phase Procise491cLC protein 
sequencer (Applied Biosystems, Lennik, Belgium). For mass spectrometry analysis the 
coomassie blue-stained band of Inip76 was cut from an SDS PAGE gel, destained in a 
200 mM ammonium bicarbonate/50% acetonitrile, air-dried and soaked in 8 |il trypsin 
solution (16 ng trypsin (Promega) in 50 mM ammonium bicarbonate) on ice for 20 min. 

10 Following overnight digestion at 37°C, the supernatant was recovered, and the gel slice 
was extracted twice using 60% acetonitrile/0.1% formic acid. The extracts and the 
supernatant were pooled and dried in a Speedvac concentrator. The peptides were re- 
dissolved in 0.1% formic acid and analysed by on-line nanoflow high performance liquid 
chromatography tandem mass spectrometry (LC/MS/MS) on an UltiMate capillary LC 

15 system (LC-Packings, Amsterdam, The Netherlands) coupled to a Q-Tof mass 
spectrometer (Micromass, Manchester, UK) equipped with an electrospray ionization 
source. Technical details of this system are reported elsewhere (Devreese et al., J. 
Chromatography A, in press). All spectra were processed using the MassLynx and 
MaxEnt software delivered with the mass spectrometer. 

20 

Example 8: Indirect Immunofluorescence Microscopy 

293T-IN s alaFLAG cells grown in Lab-Tek II glass chamber slides (VWR International) 
were fixed with 4% formaldehyde in phosphate buffered saline (PBS) for 10 min and 
permeabilized in ice-cold methanol. The cells were further blocked in PBS supplemented 

25 with 20 mM ammonium chloride and 10% FCS and incubated with rabbit polyclonal anti- 
FLAG antibodies (diluted 1:10.000 in PBS/10% FCS) (Sigma-Aldrich) and monoclonal 
anti-LEDGF (1:300) or anti-DNA PKcs (1:300) followed by Alexa-555 anti-rabbit and 
Alexa-488 conjugated anti-mouse IgG antibodies (Molecular Probes, Leiden, The 
Netherlands). The nuclear DNA was labeled with 5 pM To-Pro3 iodide (Molecular 

30 Probes). Confocal laser scanning fluorescent microscopy and imaging was carried with an 
LSM510 system (Carl Zeiss, Zaventem, Belgium) using a 488 nm Ar ion laser with a 
505-530 nm band pass filter for Alexa-488, a 543 nm HeNe laser with a 565-615 nm filter 
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for Alexa-555 and^^3 nm HeNe laser with a low pass 650 nr^^er for ToPro-3. All 
acquisitions were done in the multi-track mode. 



Example 9: Recombinant proteins 
5 The Hise-tagged HIV-1 IN was produced from the pRP1012 plasmid in the Endo I-free 
host Escherichia coli strain PCI (BL21(DE3), AendA::Tc R , pLysS) (Cherepanov et al., 
1999). The protein was purified from the soluble fraction by chromatography on Ni-NTA 
agarose (Qiagen, Hilden, Germany) and Heparin Sepharose (Amersham Pharmacia, 
Uppsala, Sweden) in the presence of 7.5 mM 3-[(3- 

10 cholamidopropyl)dimethylammonio]propanesulfonate (CHAPS) (Sigma- Aldrich). The 
His6-tag was removed by incubation of the purified protein with thrombin (Novagen). 
The His6-tagged Inip76 was expressed from the plasmid pCP6H75 in PCI by induction 
with 1 mM isopropylthiogalactopyranoside (IPTG) at 29°C for 3 hours in LB medium. 
The soluble protein was then purified by batch adsorption to Ni-NTA agarose and 

15 chromatography on a 1 ml HiTrap Heparin Sepharose column (Amersham Phamacia). 
The protein was eluted from the Heparin Sepharose column using a linear NaCl gradient 
in 30 mM TrisHCl [pH 7.0]. Peak fractions collected at approximately 800 mM NaCl 
were pooled and concentrated using Centricon-30 (Millipore, Brussels, Belgium) to 
obtain the final protein concentration of 1 mg/ml. The protein was estimated to be at least 

20 95% pure by SDS PAGE. 
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Example 10 D^^tion of the nuclear and chromosomal lo£bation of HIV IN usin2 
small interfering RNA molecules specific to the Iniv76 mRNA. 

RNA interference with synthetic short RNA duplexes has been used to knock-down 
5 expression of endogenous genes in mammalian cells (Elbashir et al., 2002). We used 
short interfering RNA (siRNA) duplexes to study the effect of depletion of Inip76 in 
293T-INsalaFLAG cells on the distribution of HTV-1 IN 



i): siRNA Preparation 

10 All oligonucleotides for siRNA preparation were chemically synthetized by 

Dharmacon (CO, USA) and Xeragon (MD, USA) and were composed of ribonucleotides 
(A,U,G and C) plus a pair of 3 '-terminal desoxy-thymidines (dTdT). 

Small interfering RNA molecules (siRNA) were prepared by annealing the 
following pairs of oligonucleotides: 

15 i) CAGCCCUGUCCUUCAGAGA-dTdT plus UCUCU G AAGGAC AGGGCUG-dTdT, 
to obtain 76A RNA; 

ii) AGACAGCAUGAGGAAGCGA-dTdT plus UCGCUUCCUCAUGCUGUCU-dTdT 5 
to obtain 76B RNA; 

iii) CAGAUGCAUUGAGGCCUUG-dTdTplus CAAGGCCUCAAUGCAUCUG-dTdT, 
20 to obtain 76C RNA; 

iv) GCGCGCUUUGUAGGAUUCG-dTdT plus CGAAUCCUACAAAGCGCGC-dTdT, 
to obtain NC RNA. 

The 76A, 76B and 76C siRNA molecules are thus designed to be specific for the Inip76 
mRNA and contain an anti-sense strand that can hybridize to the target mRNA. The NC 
25 molecule is an RNA duplex which is not specific to Inip76 RNA nor to any other known 
human mRNA. 



it Transfection of the 293 T-IhfalaFLA G Cells with the siRNA Molecules 
293T-EN s alaFLAG cells expressing FLAG-epitope tagged HIV-1 IN were seeded 
30 into 8 well Nunc LabTekll chamber glass slides (VWR International) and were 
transfected at a confluency of about 25% with each siRNA using GeneSilencer, a 
liposomal transfection reagent purchased from GTS (Gene Therapy Systems, CA, USA). 
Transfection was done according to the manufacturer's recommendations. Briefly, for 
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transfection of eac^^ll, 0.3 |ig of RNA and 1.75 jil of the Gt^feencer reagent were 
used to transfect each well of the slide. 

Expression and distribution of IN and Inip76 was assessed by indirect 
immunofluorescence. Two days post transfection the cells were fixed with 4% 
5 formaldehyde in PBS for 5 minutes, followed by four washes in PBS and treated with ice- 
cold methanol for 5 minutes. Fixed and permeabilized cells were then incubated in 
blocking solution (10% fetal calf serum/10 mM ammonium chloride in PBS) for 30 
minutes at room temperature. The cells were incubated with mouse monoclonal anti- 
human LEDGF antibody (purchased from BD Bioscience, KY, USA) diluted 1:200 in 

10 blocking buffer or monoclonal anti-FLAG M2 (Sigma-Aldrich) diluted 1 :200 for 1 hour. 
Non-bound primary antibodies were washed in three changes of blocking buffer. Cells 
were stained for 1 hour with Alexa-488 conjugated goat anti-mouse antibody (Molecular 
Probes) diluted 1:300 in blocking buffer and washed with blocking buffer. Chromosomal 
DNA was stained by addition of 5 \xM ToPro3 iodide (Molecular Probes) to the 

15 secondary antibody solution. 

Western blot analysis demonstrated that the expression levels of HIV- 1 IN in the 
cells transfected with 76A siRNA were reduced dramatically (not shown), suggesting that 
Inip76 is necessary for stability of the IN protein in human cells, likely protecting IN 
20 from proteolysis. . 



Example 11. Recombinant Inip76-IN Complex Produced in Escherichia coll 

25 As elaborated above, recombinant Inip76 is a potent stimulator of HIV IN activity 

in vitro. We wanted to reconstruct the Inip76-IN complex from recombinant proteins, as 
such complex can be potentially used to study interactions of the proteins on molecular 
level as well as an enzyme for DNA recombination in vitro and in vivo. Biochemical 
studies on HIV IN as well as other retroviral INs suffer from poor solubility of the 

30 proteins. We found that adding purified recombinant Inip76 to IN solutions had a 
solubilising effect on the IN protein. Thus, when both proteins were mixed at a 
concentration of 0.25 mg/ml in 150 mM NaCl/10 mM Hepes/1 mM MgCl 2 /0.1 % NP40. 
Approximately 50 % of the IN protein remained soluble after 45 minutes incubation at 
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4 °C and centri^J^on at 20000 g for 5 minutes. In contrast,^^ about 5 % of IN was 
recovered in soluble form when Inip76 was omitted from the mixture. This result 
indicates that the complex between IN and Inip76 is more soluble than free recombinant 
IN, and that such complex can be formed using recombinant proteins. A soluble form of 
retroviral integrase in complex with its chromosomal receptor might help to determine the 
structure of the active form of the retroviral IN. In addition, if active such complex could 
be used as an enzyme for in vitro DNA recombination. 

i. Construction of the Inip76 Expression Plasmid pCP-Nat75 



The open reading frame of Inip76/LEDGF was PCR amplified from pCP6H75 using 
the primers 5 '-TGACTCGCGATTTCAAACC and 5 5 -CCGCGAATTCTAGTTATC 
TAGTGTAGAATCCTTC. The resulting DNA fragment was digested with EcdRl and 
subcloned between Ndel (treated with T4 DNA polymerase to obtain blunt terminus) and 
15 EcoRI sites of the pRSETB vector (Invitrogen). The complete Inip76 open reading frame 
and the phage T7 promoter region of the resulting plasmid, pCP-Nat75, was sequenced to 
confirm that no mutation occurred. The plasmid was transformed into the E. coli PCI 
strain (E. coli BL21(DE3), AendA::Tc K , pLysS) (Cherepanov et al., 1999) by standard 
methods. The resulting strain, PC2LEDGF expressed Inip76 upon induction with PTG. 



ii. Construction of the EN Expression Plasmids pKB-IN6H and pKB-Al 5IN6H 



To construct the plasmid for bacterial expression of HIV- 1 IN with C-terminal His 6 
tag, we amplified the full IN open reading frame using PCR with the primers 5'-GCGCG 

25 TCGACATCCTCATCCTGTCTAC and 5 '-AATACGACTCACTATAGGG from the 
pINSD plasmid (obtained from The NIH AIDS Research and Reference Reagent 
Program, catalog #2820). The RCR fragment was digested with Ndel and Sail and 
subcloned between Ndel and Sail sites of pET-20b(+) (Novagen). The open reading frame 
of EST as well as the T7 promoter region of the resulting plasmid pKB-IN6H was 

30 sequenced to verify absence of mutations. When the plasmid was transformed into PCI 
bacteria, it expressed HIV-1 C-terminally His 6 -tagged IN. However, pKB-IN6H is not 
able to stably co-exist with pCP-Nat75, as both plasmids have the same type of 
replication origin. To obtain pKB-A16IN6H, an IN-expression plasmid compatible with 
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pCP-Nat75, we inseSd the BglWDraJR fragment of pKB-IN6lWetween BamHl and 
Bsal sites of pACYC177 vector (Chang and Cohen, 1978) (available from New England 
Biolabs) (the Drain and Bsal ends of the DNA fragments were treated with T4 DNA 
polymerase to obtain ligatable blunt termini). 

iii. Co-expression of Inip76 and His 6 -tagged HIV-1 IN in Escherichia coli 

The plasmid pKB-A15IN6H was transformed into the PC2LEDGF E. coli strain 
(BL21(DE3), AendA::Tc K , pLysS, pCP-Nat75) by standard methods. After 
transformation, bacteria harboring the newly transformed plasmids were selected on LB 
agar plates containing 120 ^g/ml ampicillin, 40 jig/ml kanamicin and 15 ^ig/ml 
tetracycline at 28 °C. This procedure resulted in introduction of the pKB-A15IN6H 
plasmid and knock-out of the incompatible pLysS plasmid. The resulting strain 
PC2LEDGF-C (BL21(DE3), AendA::Tc K pCP-Nat75, pKB-A15IN6H) was grown on LB 
agar plates and in liquid LB medium supplemented with 120 jig/ml ampicillin, 40 jig/ml 
kanamicin and 15 |ig/ml tetracycline at 28 °C Expression of Inip76 and IN was induced 
by addition of 1 mM IPTG at 29 °C in LB medium. 



iv Purification of the Inip76-IN Complex Expressed in E. coli 



An overnight culture of PC2LEDGF-C (100 ml) was diluted with 41 of fresh LB 
medium supplemented with 120 \xgjm\ ampicillin and 40 |ig/ml kanamicin and grown in a 
shaking incubator at 28 °C until its optical density (measured at 600 nm) reached 
approximately 0.8, then the culture was induced with 1 mM IPTG. After 3 hours 
expression at 29 °C the bacteria were collected by centrifugation and kept frozen 
overnight at -70 °C. Next day defrozen bacteria were re-suspended in 20 ml of cell 
breaking buffer (500 mM NaCV30 mM Hepes/1 mM MgCl 2 /0.1 mM PMSF) and lysed by 
two passages through a French press at a cell pressure of 18000 psi. The resulting 
bacterial lysate was cleared by centrifugation at 15000 g for 30 min supplemented with 25 
mM imidazol and mixed with 1.5 ml of Ni-NTA resin (Qiagen) equilibrated with the 
same buffer. The slurry was gently mixed for 30 minutes to allow His-tagged protein to 
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washed with 50 ml of 400 mM NaCl/30 mM Hepes/1 mM MgCl 2 /25 mM imidazol, 
pH7.2 followed by 12 ml of 400 mM NaCl/30 mM Hepes/1 mM MgCl 2 /35mM 
imidazol, pH7.2. The bound protein was then eluted in the elution buffer (400 mM 
5 NaCl/30 mM Hepes/1 mM MgCl 2 /200 mM imidazol, pH 7.2). Both IN and Inip76 eluted 
in the presence of 200 mM imidazol, as can be seen from the SDS-PAGE gel shown in 
FIG. 9A. In a control experiment, carried-out with the lysate of induced PC2LEDGF, 
Inip76 was detected only in the non-bound fraction, and no Inip76 was detected in the 
fractions eluted with 200 mM imidazol (FIG. 9B). Therefore, as was expected, in the 

10 absence of His6-tagged IN, Inip76 does not bind to Ni-NTA confirming that all Inip76 
found in the samples is indeed associated with IN. 

The fractions containing IN and Inip76 were pooled together and injected into a 
1 ml HiTrap SP Sepharose column (Amersham-Phaimacia) equilibrated with 35 % 
chromatography buffer B in A (buffer A was 1 mM MgCla/30 mM Hepes, pH 7.2, buffer 

15 B was 1 M NaCl/1 mM MgCl 2 /30 mM Hepes, pH 7.2). The column was washed with 
35 % B at 0.7 ml/min and the bound proteins were eluted in a 15 ml linear gradient from 
35 % B to 90 % B. The host-derived contaminating proteins, including the 70 kDa band, 
as well as free IN protein were found in the flow-through fraction (FIG. 9C). The Inip76- 
IN complex eluted in approximately 55 % B, corresponding to 550 mM NaCl (fractions 

20 16-22 in FIG. 9C). The fractions containing Inip76 and IN were pooled and concentrated 
by ultrafiltration using Centricon-30 (Millipore) to a final volume of approximately 70 pi 
(FIG. 10A). The final protein concentration was determined to be 4.8 mg/ml using BCA 
kit (Pierce) and BioRad protein assay (BioRad) with bovine serum albumin as a standard. 
The protein was kept at 4 °C for a week without signs of aggregation. In contrast, free 

25 HIV IN was not soluble at these concentrations in buffers with physiological pH. 
Detergents such as CHAPS or NP40 were needed to keep the protein from aggregation 
even at protein concentrations below 1 mg/ml. 

Example 12: His Tazlnteerase Pull-down Assay 
30 Binding of IN to LEDGF/p75 was assayed in 25 mM Tris-HCl, pH 7.4, 0.1% Nonidet P- 
40, 20 mM imidazole containing 100 or 400 mM NaCl, in the presence of 1 mM MgCl 2 
(binding buffer). 1 \ag of recombinant His 6 -tagged HIV-1 IN, HIV-2 DSf or HTLV-2 IN 
was incubated with 1-3 ^ig of LEDGF/p75 in 200 ^1 of binding buffer supplemented with 
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2 ug of bovine serunflnbumin (BSA). Following a 30-min incubat^Ru 4°C, the mixtures 
were centrifuged to remove aggregated protein. The samples were supplemented with 40 
ul of Ni-NTA-agarose and stirred for an additional 30 min. The agarose beads were 
recovered by centrifugation and washed with 500 ul of binding buffer. Bound proteins 
were eluted in 40 ul of binding buffer supplemented with 200 mM imidazole and 1% 
SDS and analyzed by SDS-PAGE electrophoresis followed by staining with Coomassie- 
R250. We obtained the pKP1013 plasmid, coding for HIV-2 IN, from Dr. R. P. Plasterk, 
Nederlands Kankerinstituut, Amsterdam. pHTLV2 was obtained from C. Jonsson, New 
Mexico State University, New Mexico (Wang et al, 2001). Both His-tagged proteins 
were purified using Ni 2+ NTA and Heparin cohims. 

Example 13: Stable knock-down ofLEDGF/p75 and rel ating experiments regarding HTV- 
1 replication 
Cell culture 

MOLT-4 cells, clone 8 (Kikukawa et al, 1986) were grown in a humidified atmosphere 
with 5% C0 2 at 37°C and maintained in RPMI 1640 medium supplemented with 10% 
heat-inactivated fetal calf serum, 2 mM L-glutamine, 0.1% sodium bicarbonate and 20 
ug/ml of gentamycin. 

Transduction of MOLT-4 cells with a retroviral vector for siRNA delivery. 
Molt-4 cells were transduced with the retroviral vector MSCV7U6-p75. The MSCV/u6- 
cont was used as a control. Both vectors are described in Silver and Devroe, 2002. 
Briefly, two complementary oligonucleotides corresponding to the positions 1420-1441 
(Stephane maybe you can indicate the positions of the different siRNA's on the Figure?) 
of the open reading frame of p75 (GenBank accession number AF063020) were ligated 
into the pMSCVpuro (Clontech) in order to create the MSCV/U6-p75 vector. For the 
construction of MSCV/U6-cont, an irrelevant hairpin construct was used. Forty-eight 
hours post-transduction, selection of transduced cells was started by addition of 
puromycin. The LEDGF/p75 knock-down was verified in the MSCV/U6-p75 transduced 
polyclonal MOLT-4 cells by Western blotting. 

For western blotting we used mouse monoclonal antibodies (BD Biosciensces).and the 
ECL+ chemiluminescent detection system (Amersham Biosciences) with gout anti-mouse 
antibodies (DAKO, Denmark) and horseradish peroxidase. 
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HIV infection 

Approximately 10 5 MOLT-4 cells stably transduced with MSCV/U6-p75 or with 
MSCV/u6-cont were pelleted and incubated with HIV-1 (ni B) for two hours at different 
virus dilutions. Cells were washed twice with RPMI and resuspended in 10 ml RPMI, 
5 containing puromycin at a concentration of 1 jig/ml. The cells were subcultured at 
comparable density every 2 to 3 days. Cell-free supernatans was analysed for p24 
determination by ELISA (HIV-1 p24 Core Profile ELISA, duPO?T, Dreieich, Gennany). 
Every 2 to 3 days, the cell cultures were monitored for the appearance of HIV-induced 
cytopathogenic effect. Uninfected but stably transduced cells were used as a negative 
10 control. 



Results 

15 Example 14: HIV-1 IN is present in the insoluble nuclear fraction 

The 293T-EN s ala cell line used in this work was similar to the previously reported 293T- 
IN 5 , except that it expresses HIV-1 IN with the Met-Ala dipeptide at its N-terminus 
(instead of Met-Gly) (Cherepanov et al., 2000). This change was introduced to prevent 
potential myristoylation of the protein (Boutin, 1997) and affected neither cell line 

20 stability nor IN expression levels. The integrase protein was nuclear in both cell lines as 
determined by indirect immunofluorescence microscopy (data not shown and Cherepanov 
et al., 2000). After centrifugation of the digitonin- or NP40-treated 293T-lN s ala cells, 
most of the IN protein was retained in the nuclear pellet (Figure 1A). Since NP40 
permeabilizes both the plasma membrane and the nuclear envelope, the bulk of IN present 

25 in the ceil is thus stably associated with insoluble nuclear structures. IN was extracted 
from the nuclei by increased salt concentrations; 350-500 mM NaCl was sufficient to 
elute the protein (Figure IB). Therefore, buffers supplemented with 400 mM NaCl were 
used for preparing nuclear salt extracts. 

Some chromatin-associated proteins can be eluted from detergent-permeabilized 

30 nuclei by gentle treatment with DNAsel or micrococcal nuclease (Fujita et al., 1997; 
Holthoff et al, 1998; Meller and Fisher, 1995). We observed no elution of IN when nuclei 
prepared from 293T-IN s ala cells were treated with DNasel (Figure 1C), that completely 
digested nuclear DNA to fragments of less then 200 bp (data not shown). In accordance 
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with Fujita et al. ^Fujita et al., 1997), MCM3, a chromosomaffeplication factor, was 
removed from the nuclei by gentle DNasel treatment (Figure 1C). The same salt 
concentration was required to extract IN from the nuclease-digested nuclei (data not 
shown). However, subjection of the DNase-digested nuclei to low ionic strength 
5 conditions led to efficient elution of the protein (Figure ID). Non-digested nuclei did not 
release any detectable amount of IN in the hypotonic conditions (Figure ID). These 
results indicate that IN is associated both with chromosomal DNA and with some other 
nuclear structure, which is destabilized in low ionic strength conditions. Similar results 
were obtained when 293T-IN s ala nuclei were exposed to micrococcal nuclease (data not 
10 shown). 



Example 15: Chemical Cross-linkins of IN Complexes Present in Nuclear Extracts 

We used S^'-Dithiobisfsulfosuccinimidyl propionate] (DTSSP), an amine-specific N- 

hydroxysuccinimid ester with a 12 A spacer arm, to cross-link protein complexes present 

15 in the nuclear extracts of 293T-IN s ala cells. The nuclear proteins were extracted from the 
NP40-treated 293T-IN s ala nuclei using the cytoskeleton buffer containing 400 mM NaCl 
(400CSK). The total protein concentration was adjusted to 100, 20 and 4 (ig/ml; the 
samples were incubated with DTSSP and separated in a non-reducing 4-12% SDS PAGE 
gel. IN-containing cross-linking adducts were detected by western blotting using 

20 polyclonal anti-IN antibodies. A typical result is shown in Figure 2A. In the non-cross- 
linked samples (lanes 2, 6 and 10) as well as in the samples cross-linked in the presence 
of SDS (lane 1) only IN monomer and a band corresponding to IN dimer (approximately 
60 kDa) were apparent. Addition of 0.1-2 mM DTSSP yielded cross-linked complexes of 
60 kDa (p60 cl ), 150-180 kDa (the pl50 cl band, clearly visible in lanes 7, 8 and 12), 250- 

25 350 kDa (p300 cl , lanes 7 and 8) and less resolved higher molecular weight (MW) species. 
Strikingly, detection of the cross-linked IN complexes with our polyclonal anti-IN 
antibodies was far more sensitive than detection of the non-cross-linked IN (compare the 
lanes 2 and 4, for example). Probably, some strong conformational epitopes are better 
preserved within cross-linked IN during SDS PAGE and western blotting. No unspecific 

30 bands were revealed in nuclear extracts from 293T cells before and after cross-linking 
with DTSSP confirming that all bands detected in the western blot correspond to IN- 
containing complexes (data not shown). The cross-linking of the 293T-IN s ala nuclear 
extract was clearly dependent on the concentration of both DTSSP and protein. Saturating 
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cross-linking (^feuted protein samples followed by dena^nng SDS PAGE has been 
used to determine MWs of native protein complexes (see (Corey et al, 1998) and 
references therein). Cross-linking of the diluted nuclear extract (4 \xg/ml protein) with 
2 mM DTSSP yielded a band of approximately 150 kDa (pl50 cl , lane 16), while in more 
concentrated extracts (100 ^g/ml) a diffuse band of 250-300 kDa was the most prominent 
(p300 cl , lane 8). Thus, there exist at least two different IN complexes: a large complex at 
higher protein concentrations and a smaller complex in the diluted extract. The pl50 cl 
product seems to be the result of complete cross-linking since no significant change in 
cross-linking occurs when the concentration of DTSSP was increased from 0.5 to 2 mM 
(compare lanes 15 and 16) and higher (not shown). Thus, pl50 cl probably represents the 
IN complex present in the diluted nuclear extract. Moreover, this complex is a 
dissociation product and importantly, a component of the larger complex, since it 
appeared as a partial cross-linking adduct in the reactions with more concentrated protein 
extracts at 0.5 mM DTSSP and it decreased at 2 mM DTSSP (compare lanes 7 and 8). 
However, the p300 cl band is probably not the result of complete cross-linking of the larger 
complex, since a strong smear and some less resolved bands are present above p300 cl on 
the western blot; aggregated material not able to enter 4% acrylamide gel is also evident 
(lanes 8 and 12). Some of the high MW adducts in the reactions with 100 fig/ml extracts 
may result from non-specific inter-molecular cross-linking of proteins. Cross-linking of 
IN complexes in the nuclear extracts using oxidizing Cu 2+ -[l,10-phenanthroline] 3 
complex (Cys-Cys cross-linker (Ji, 1983)) were also suggestive for the presence of a large 
protein complex that dissociated upon dilution releasing a molecule of approximately 
150 kDa (data not shown). 



Example 16: Apparent Stokes Radii of the two Nuclear Inteerase Com plexes 
To confirm the presence of both IN complexes and deduce their size we used gel 
filtration. Undiluted (550 jag/ml protein) and diluted (30 jxg/ml) nuclear salt extracts from 
293T-IN s ala cells were run on a calibrated Superdex 200 column and the IN elution was 
followed by immunoblotting the collected fractions (Figure 3A). We observed two 
distinct elution volumes corresponding to two different IN complexes. After 
chromatography of the undiluted extract, IN eluted symmetrically with a peak maximum 
in fractions 8 and 9 (elution volume, V e = 1 1.2 ml; Stokes radius, R s = 61 A, see Figure 
3B). However, IN eluted later and in a broader peak with a V c of 14.1 ml (R* = 41 A, 
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Figures 3A and 3B) when the sample was diluted prior to gel filtration. Assuming both 
complexes are globular, their MWs can be calculated to be 380 kDa and 115 kDa 
respectively (Figure 3C). The smaller dilution-resistant molecule (R* = 41 A) most likely 
corresponds to the pl50 cl cross-linked complex observed in the previous experiment, 
while partial cross-linking of the 61 A IN complex probably resulted in p300 cl . The 
effective dilution of the peak fractions on the column was 5- to 7- fold as calculated from 
calibration runs (data not shown). Thus, the sample concentrations used in our gel 
filtration experiments corresponded to those used in the cross-linking experiments 
(100 fxg/ml and 4 ^g/ml). 



Example 1 7: Purification and Characterization of FLAG-Tazzed IN Complexes 
To facilitate isolation of native IN complexes from cell extracts we modified the IN 
expression construct adding the FLAG epitope tag at the carboxy terminus of IN. The 
293T-BSf s alaFLAG cell line, obtained by stable transfection of 293T cells with the tagged 
15 expression construct, was very similar to 293T-IN s ala in stability and levels of IN 
expression (not shown). FLAG-tagged IN (BSf f ) localized predominantly in the nuclei in a 
diffuse pattern and was associated with chromosomes during mitosis (see below) as has 
been previously reported for non-tagged HIV-1 IN (Cherepanov et al., 2000). EM f could 
be extracted from the nuclei of 293T-IN s alaFLAG cells in the same conditions as for non- 
20 tagged IN. The cross-linking pattern of IN f with DTSSP was very similar to that of non- 
tagged IN (Figure 2B). The two major cross-linking products of INf showed slightly 
slower migration in SDS PAGE gels than the original pl50 cl and p300 cl , which can be 
attributed to the negative charge of the FLAG tag and increased molecular weight of the 
tagged protein. For convenience, however, we refer to the INf cross-linking adducts as 
25 pl50 cl and p300 cl . The gel filtration profiles were as observed for the non-tagged IN 
extracted from 293T-EM s ala cells (see below and data not shown). 

In initial immunoprecipitation experiments, we incubated diluted nuclear extracts 
from metabolically labeled 293T-IN s alaFLAG cells with the anti-FLAG M2 antibody and 
protein G agarose overnight. The protein isolated in this way displayed a single specific 
30 band in SDS PAGE gels migrating at the expected position for the FLAG-tagged IN 
(33.5 kDa) (Figure 4A). Isoelectrofocusing of immunoprecipitated INf in denaturing pH 
gradients showed a major band close to the predicted pi (6.5), which reacted with anti-IN 
serum in immunoblot (data not shown). When the IN f immunoprecipitated from a nuclear 
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extract of 293TWalaFLAG cells was eluted from the antfflLAG M2 antibody with 
synthetic FLAG peptide and incubated with DTSSP, the pl50 d cross-linking product was 



the immunoreactive reaction products accumulated at the top of the gel, suggesting 
5 aggregation of the protein (data not shown). We were not able to find reaction conditions 
to reproduce the p300 cl cross-linking product with INf preparations purified this way. 
Fractionation of purified INf on a Superdex column showed a peak with a Ka V value very 
close to that of the 41 A complex (Figure 4C). The fact that the 41 A complex exists in 
the purified IN f preparation and the apparent MW (115-150 kDa), based on cross-linking 

10 and gel filtration experiments, suggest that the 41 A molecule is a tetramer of IN. 
Apparently, the native 61 A IN complex was not stable enough to withstand 
immunoprecipitation under these conditions. Next, we tried shorter immunoprecipitation 
times (3-5 hours) starting from more concentrated nuclear extracts. Although the overall 
yield of INf decreased, the immunoprecipitated samples were found to contain an 

15 additional protein. It had an apparent MW of approximately 76 kDa, as determined by 
SDS PAGE (Figure 5A) and was present at variable ratios to INf in different preps. This 
protein, here referred to as Inip76 (for 76 kDa IN interactor protein), is specifically 
associated with INf, since, it could not be immunoprecipitated from the parental 293T 
cells with the anti-FLAG antibody (compare lanes 2 and 3 in Figure 5A). When undiluted 

20 nuclear extract from 293T-IN s alaFLAG cells was immunoprecipitated with anti-FLAG 
antibody for 4 hours both IN f and Inip76 bands were readily detected (Figure 5B). 
Although extending immunoprecipitation to 16 hours improved IN f recovery, the yields 
of Inip76 were greatly reduced (compare the lanes 1 and 2 in Figure 5B). Intriguingly, the 
p300 cl band, detected after DTSSP cross-linking of the nuclear salt extracts, was readily 

25 observed when the Inip76-containing DST f preparations were cross-linked with DTSSP 
(Figure 6E), suggesting that Inip76 is part of a large IN complex present in the nuclear 
extracts. 

Example 18: Identification of the Inip7 6 Protein as LEDGF/DFS70/p75 
30 By upscaling immunoprecipitation we were able to isolate sufficient amounts of Inip76 
for characterization by Edman degradation and mass spectrometry (Figure 6A). The 
amino terminal sequence obtained from Inip76 was XXRDFKPGD (the first two residues 
being not resolved due to background noise). Scanning the TrEMBL protein database for 



readily obtained (Figure 4B). When higher ENf concentrations were used in cross-linking, 
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human proteins carrying this sequence tag (Wilkins et al, 1996) resulted in four hits with 
accession numbers 095368, Q9UER6, Q9NZI3, 075475, all corresponding to the two 
alternative products of one gene: LEDGF/DFS70/p75 (lens epithelium-derived growth 
factor, referred hereafter as LBDGF) and the p52 protein ((Ge et al., 1998; Singh et al, 
2000)). Although the actual MW of LEDGF is approximately 60 kDa, it has been 
reported to migrate as a 75 kDa band in SDS PAGE gels (Ge et al., 1998). On-line liquid 
chromatography tandem mass spectrometry analysis (LC/MS/MS) of tryptic peptides 
obtained by in-gel digestion of Inip76 provided further evidence that Inip76 is indeed 
identical to LEDGF. Lens epithelium-derived growth factor (LEDGF), is a member of the 
epatoma-derived growth factor family, is found at low levels in many actively dividing 
and long lived cells. Its gene yields tow proteins, LEDGF/p75 and p52, by alternative 
splicing (Ge, H., Si, Y, and Roeder, R. G. (1998) EMBO J. 17, 6723-6729 and 2). 
LEDGF belongs to a family of homologous proteins including hepatoma-derived growth 
factor (HDGF) (Nakamura, H, Izumoto, Y, Kambe, H, et al (1994) Molecular dining of 
complementary DNA for a novel human hepatoma-derived growth factor: its homology 
with high mobility group-1 protein J Biol Chem 269, 25143-25149) and HDGF-related 
protein-1 and -2. (Izumoto, Y, Kuroda, T, Harada, H, Kishimoto, T, Nakamura, H. (1997) 
Hepatoma-derived growth factor belongs to a gene family in mice showing significant 
homology in the amino terminus Biochem Biophys Res Commun 238,26-32). 
Half of the predicted LEDGF tryptic peptides within the mass range of 1000-2500 Da 
could be identified in the sample and their MS/MS spectra readily matched LEDGF 
covering approximately 18% of its sequence (Figure 6B and Tablel). Moreover, Inip76 
strongly reacted with a commercially available monoclonal anti-LEDGF antibody (see 
below and data not shown). Most of INf present in nuclear extracts could be 
immunoprecipitated with the anti-LEDGF antibody, whereas only about 10 % of LEDGF 
could be recovered with the anti-FLAG antibody (Figure 6C), suggesting that LEDGF is 
present in an excess over INf in the extract. Non-tagged IN could also be efficiently 
precipitated with the anti-LEDGF antibody, and a fraction of the LEDGF could be 
precipitated with polyclonal anti-IN antibody in similar conditions from nuclear salt 
extracts prepared from 293T-ESf s ala cells. 
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Example 19: Inip76 is part of the 61 A HIV IN complex 

To determine whether the 61 A complex contains Inip76/LEDGF, we pre-incubated the 
nuclear salt extract from 293T-IN s alaFLAG cells with a monoclonal anti-LEDGF 
5 antibody prior to chromatography on a Superdex column. The INf elution profile changed 
dramatically: the peak eluted now near the void volume of the column (Figure 6D). 
Elution of the 61 A complex was not altered by pre-incubation of the extract with an 
unrelated mouse IgGl (Figure 6D). Predictably, elution of the 41 A IN complex (the 
presumed IN tetramer) did not change after pre-incubation of the diluted nuclear extracts 

1 0 with the anti-LEDGF antibody. 

When INf-Inip76 complex was purified by immunoprecipitation and cross-linked 
with DTSSP, the p300 cl band could be readily detected in an immunoblot with anti-IN 
antibody (Figure 6E). However, p300 cl did not react with a monoclonal anti-LEDGF 
antibody; instead a western blot with the anti-LEDGF antibody revealed two bands 

1 5 migrating at higher positions in the gel (pHMWi cl and pHMW 2 cl , Figure 6E) (the MWs of 
these molecules are too high to be determined from SDS PAGE). These results suggest 
that p300 d is a product of incomplete cross-linking of the 61 A IN-Inip76 complex. Being 
composed of IN alone, p300 cl probably represents an octamer of EST (a dimer of 
tetramers). Contacts between Inip76 and IN within the 61 A complex may be less prone 

20 to cross-linking with DTSSP than those between IN protomers. We cannot, however, 
exclude the possibility that the target epitope for the monoclonal anti-LEDGF antibody 
used is masked or destroyed within p300 cl . In addition to the major p300 cl product, a band 
at a position close to pHMWi c! is present on the anti-IN immunoblot of the purified and 
cross-linked INf-Inip76 complex (Figure 6E, lane 2). Thus, the pHMWi cl adduct is 

25 probably the smallest cross-linked IN complex containing Inip76. 

Example 20: Inip76 co-localizes with IN within nuclei of293T-IN s alaFLAG cells 
In mammalian cells, LEDGF is expressed as a nuclear protein; it is distributed in a diffuse 
manner throughout the nucleus during interphase and is stably associated with condensed 
30 chromosomes throughout metaphase and anaphase (Nishizawa et al., 2001). A similar 
subcellular distribution has been reported for HIV-1 IN (Cherepanov et al., 2000). 
Immunofluorescent detection of both IN f and Inip76/LEDGF in fixed 293T-IN s ala cells 
revealed strikingly similar intranuclear distribution patterns for both proteins (Figure 7A). 
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In accordance previous reports, both proteins were^^und to condensed 

chromosomes in mitotic cells (Figure 7B). The distribution of another nuclear protein, the 
catalytic subunit of DNA-dependent protein kinase (DNA PKcs) clearly differed from 
that of EKf (Figure 7C). In addition, DNA PKcs was excluded from condensed 
5 chromosomes in mitotic cells (data not shown). Intriguingly, the nuclear localization of 
IN f and Inip76 did not correspond to the overall DNA staining pattern, arguing against the 
possibility that the apparent co-localization of the two proteins might be merely due to 
their independent association with chromosomal DNA. In a control experiment, we 
visualized INf using a mixture of polyclonal and monoclonal anti-FLAG antibodies; the 
10 obtained two-color INf staining was similar to that of INf and Inip76 (data not shown). 

Example 21: Iniv76 is an activator of HIV-1 IN in vitro 

We have previously described activities of recombinant HIV-1 IN on the mini-HIV 
substrate, a linear 4.7 kb double stranded DNA molecule, carrying the U3 and U5 

15 terminal fragments of the viral LTR sequences (Cherepanov et al., 1999). Recombinant 
HIV-1 IN on itself was proficient in carrying-out cleavage and strand transfer using this 
long DNA substrate; however, presence of 5-12% polyethylene glycol (PEG) in the 
reaction was required for the enzymatic activity. To ascribe a possible function to the 
observed Inip76-IN interaction, we examined whether Inip76 could modulate enzymatic 

20 activity of HIV-1 IN in vitro. The mini-HIV substrate was incubated with recombinant IN 
and His 6 -tagged Inip76 in the absence of PEG, and the reaction products were analyzed 
by native agarose gel electrophoresis (Figure 6F). In the absence of Inip76, strand transfer 
products were almost undetectable (lane 3). Addition of Inip76 resulted in a potent 
stimulation of the reaction. In some conditions, approximately half of the substrate DNA 

25 was converted into various reaction products including those that were too large to enter 
the gel (lanes 7 and 8). Remarkably, both the overall efficiency of the reaction and the 
range of the strand transfer products depended on the concentration of Inip76. No 
significant variation in the yield of the strand transfer products was detected when the 
order of addition of Inip76 and IN to the mini-HIV reaction was reversed (data not 

30 shown). Although addition of Inip76 to IN reactions containing short oligonucleotide 
substrate stimulated the strand transfer activity, presence of PEG and DMSO was still 
essential for efficient Mg 2+ -dependent activity of our recombinant enzyme preparation 
(Cherepanov et al., 1997)). 
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Example 22: Disruption of the nuclear and chromosomal localization of HIV IN using 
small interfering RNA molecules specific to the Inip76 mRNA. 

5 Immunofluorescent detection of Inip76 and IN revealed that Inip76 expression 

was efficiently knocked-down in the majority of cells transfected with 76A, 76B and 76C 
molecules. Moreover, IN re-distributed from the nuclei to the cytoplasm of the transfected 
cells as illustrated in the FIG. 8B. Importantly, after knock-down of Inip76 expression, IN 
was not able to bind to chromosomal DNA, which was evident in all mitotic cells with 
10 condensed chromosomes (FIG. 8C). The distribution and expression of IN and Mp76 
remained unchanged in mock-transfected cells or cells transfected with NC RNA (FIG. 
8A). 

Example 23: Recombinant Iniv76-IN Complex Produced in Escherichia coli: Activity of 
15 the Purified Recombinant IN-Inip76 Complex 

The purified complex was supplemented with the IN mini-HIV substrate in the 
conditions similar to the mini-HIV reactions described above. The final reactions 
contained 1.5, 0.75, 0.37, 0.18 and 0 \ig of the IN-Inip76 complex, 150 ng of the mini- 

20 HIV substrate in the final volume of 20 jal. The reactions also contained 110 mM NaCl, 
5 mM MgCl 2 , 5 mM DTT, 20 mM Hepes, pH 7.5. After 90 min incubation at 37 °C, the 
reactions mixtures were supplemented with 0.2 % SDS, 2 mM EDTA plus 5 \xg of 
proteinase K and further incubated at 37 °C for 20 min, to disrupt all protein-DNA 
complexes. The reaction products separated in 0.8 % agarose gel were detected by 

25 staining with the SybrGold stain (Molecular Probes) (FIG. 10B). The activity of the 
complex was similar to the activity of IN in the presence of separately added Inip76 as 
shown above. 

Example 24: His Tag Intezrase Pull-down assay - Recombinant LEDGF/p75 forms a 
30 stable complex with HIV-1 IN and HIV-2 IN, but not with HTL V-2 IN. 

We have previously shown that His-tagged HIV-1 IN bound to a Ni 2+ -chelating resin 
stably interacts with LEDGF/p75 in solution (Maertens et al., 2003). The specificity of 
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this assay was dJ^trated by the lack of interaction with rft GF/p52. To verify 
whether LEDGF interacts with all human retroviral integrases, we repeated this assay 
using His-tagged HIV-2 and HTLV-2 IN (Figure X). Interestingly, LEDGF/p75 
interacted with integrase of lentiviral HIV-2 but not with integrase of retroviral HTLV-2. 
We have previously shown that LEDGF/p75 increases the solubility of HTV-1 integrase; 
this is not the case with HTLV-2 IN (data not shown). These experiments suggest that 
LEDGF/p75 is a lentiviral-specific co-factor. 

Example 25: Knock-down ofLEDGF/p75 with vec tor mediated RNAi • 



Polyclonal MOLT-4-derived cell lines with stable knock-down of LEDGF/p75 were 
created using retroviral vectors that encode a small haiipin RNA targeting specifically 
LEDGF/p75 but not p52. Stable knock-down (up to 90%) of LEDGF/p75 expression was 
verified by western blotting (Figure 12A). A control MOLT-4 cell line stably expressing 
15 an unrelated hairpin was selected in parallel. Growth kinetics of both selected cell lines 
were comparable. 

The cell lines were infected with 10-fold dilutions of HTV-1 (Hie) and virus breakthrough 
was followed in time (Figure 12B). A clear delay of HTV replication was observed in the 
cell lines expressing reduced levels of LEDGF/p75 at all virus dilutions tested. 
20 Replication of HTV-1 added at an moi of 2.10" 6 was even completely abolished. These 
data validate LEDGF/p75 as an antiviral target. 
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1. The use of nucleic acids mediating RNA interference specific for mRNA of a protein of 
the hepatoma-derived growth factor family, for the modulation of the nuclear transport 

5 of lentiviral integrase. 

2. The use of nucleic acids mediating RNA interference specific for mRNA of a protein of 
the hepatoma-derived growth factor family, for the manufacture of a medicament for the 
prevention and/or treatment of a viral infection. 

10 

3. The use of a protein of the hepatoma-derived growth factor family, a homologue, a 
derivative or fragment thereof to increase the solubility of lentiviral integrase during 
crystallisation purposes, antiviral testing and viral research. 

15 4. The use of an isolated and purified polynucleotide encoding the Inip76 protein or an 
intermediate or a fragment of said protein, an allelic variant, a homologue, a portion or a 
mutation thereof, for the manufacture of an lentiviral-related diagnostic tool. 

5. The use of an isolated and purified polynucleotide encoding the Inip76 protein or an 
20 intermediate or a fragment of said protein, an allelic variant, a homologue, a portion or a 

mutation thereof, for the screening of molecules for their anti-lentiviral activity. 

6. The use of an isolated and purified polynucleotide encoding the Inip76 protein or an 
intermediate or a fragment of said protein, an allelic variant, a homologue, a portion or a 

25 mutation thereof, for the manufacture of a medicament for the prevention and/or 
treatment of a viral infection (more particularly a lentiviral infection, more particularly 
HIV). 

7. A polynucleotide encoding the Inip76 protein or an intermediate or a fragment of said 
30 protein, an allelic variant, a homologue, a portion or a mutation thereof, further 

comprising a polynucleotide which codes for a least a portion of the HIV integrase. 

8. The polynucelotide of claim 7, the combination of both polynucleotides being arranged 

in such a way that a fusion protein results after expression. 
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9. The use of the Inip76 protein, a homologue, a variant, a mutated form or a fragment 
thereof, for the manufacture of a medicament for the prevention and/or treatment of a 
viral infection (more particularly a lentiviral infection, more particularly HIV). 

10. The use of the Inip76 protein, a homologue, a variant, a mutated form or a fragment 
thereof, for the manufacture of an HIV-related diagnostic tool. 

11. The use of the Inip76 protein, a homologue, a variant, a mutated form or a fragment 
thereof, for the screening of molecules for their anti-lentiviral activity. 

12. The use of the Inip76 protein, a homologue, a variant, a mutated form or a fragment 
thereof for the crystallisation of a lentiviral-integrase. 

13. The use of a molecule which comprises a region specifically interacting with a protein 
of the hepatoma-derived growth factor family or nucleic acids encoding said protein of 
the hepatoma-derived growth factor family, for the manufacture of a medicament for the 
prevention and/or treatment of a viral infection. 



14. The use of a molecule which comprises a region specifically interacting with a protein 
of the hepatoma-derived growth factor family or nucleic acids encoding said protein of 
the hepatoma-derived growth factor family, for the modulation of the interaction of 
Inip76 with lentiviral integrase in antiviral research. 

15. The use of a molecule as in claim 13 and 14, wherein the protein of the hepatoma- 
derived growth factor family is Inip76. 

16. The use of a molecule as in claim 13 to 15, wherein said molecule is selected from the 
group comprising an antibody or any fragment thereof, a peptide, a small molecule, an 
antisense nucleic acid, an antigene compound, a ribozyme, nucleic acids mediating RNA 
interference and a variant polypeptide. 

17. The use of a molecule as in claim 13 to 15, wherein said molecule is an antibody. 
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18. The use of a^^ecule as in claim 13 to 15, wherein saicflfclecule is a nucleic acid 
mediating RNA interference specific for mRNA of Inip76. 

19. The use of a molecule as in claim 13 and 15, wherein said molecule is a ribozyme. 

20. The use of a construct mediating gene therapy specific for modifying the expression of 
Inip76, for the manufacture of a medicament for the prevention and/or treatment of a 
viral infection. 

21. A composition comprising: 

a) a molecule which comprises a region specifically interacting with a protein of the 
hepatoma-derived growth factor family or nucleic acids encoding said protein of the 
hepatoma-derived growth factor family, and 

b) one or more compounds effective in the treatment or prevention of viral infections. 

22. A method to identify molecules which comprise a region specifically binding to a 
protein of the hepatoma-derived growth factor family or nucleic acids encoding said 
protein of the hepatoma-derived growth factor family, comprising the steps of exposing 
said protein or nucleic acids encoding said protein to at least one molecule whose ability 
to suppress said protein of interacting with a retroviral integrase protein is sought to be 
determined, followed by the determination of the binding or hybridisation of said 
molecule(s) to said protein or to binding places of said protein on said integrase or to 
nucleic acids encoding said protein and monitoring the prevention or suppressing of 
retroviral replication or integration by the usage of at least one of said molecules. 

23. A method to increase the nuclear import of lentiviral integrase, comprising the use of a 
protein of the hepatoma-derived growth factor family or a analog, derivative or fragment 
thereof. 

24. A protein complex comprising a retroviral integrase and Inip76. 

25. A protein complex as in claim 24, wherein said integrase and Inip are recombinantly 
produced. 
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26. The use of Inip^Wa cellular cofactor of lentiviral integratioi]^^ 

27. The HIV-integrase, characterised in that said integrase is in its tetrameric form. 

28. The use of the HIV integrase as in claim 27 for screening purposes, crystallography and 
diagnostic purposes. 

29. The use of a protein of the hepatoma-derived growth factor family, a homologue, a 
derivative or fragment thereof for the crystallisation of lentiviral integrase. 
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